docs: rename article slug
6
.gitignore
vendored
@ -1,13 +1,7 @@
|
||||
/.vscode
|
||||
/_site
|
||||
/.sass-cache
|
||||
/node_modules/
|
||||
/npm-debug.log
|
||||
/.jekyll-metadata
|
||||
/images/crushed
|
||||
/.bundle
|
||||
/vendor/bundle/
|
||||
|
||||
.DS_Store
|
||||
Thumbs.db
|
||||
db.json
|
||||
|
2
LICENSE
@ -1,6 +1,6 @@
|
||||
The MIT License (MIT)
|
||||
|
||||
Copyright (c) 2020 Yasuaki Uechi (https://uechi.io)
|
||||
Copyright (c) 2021 Yasuaki Uechi (https://uechi.io)
|
||||
|
||||
Permission is hereby granted, free of charge, to any person obtaining a copy
|
||||
of this software and associated documentation files (the "Software"), to deal
|
||||
|
@ -1,6 +1,6 @@
|
||||
# uechi.io
|
||||
|
||||
[](https://travis-ci.org/uetchy/uechi.io)
|
||||
[](https://github.com/uetchy/uechi.io/actions?query=pages)
|
||||
|
||||
## Build
|
||||
|
||||
@ -8,12 +8,6 @@
|
||||
yarn build
|
||||
```
|
||||
|
||||
## Proofread
|
||||
|
||||
```
|
||||
yarn test
|
||||
```
|
||||
|
||||
## Publish
|
||||
|
||||
```
|
||||
|
25
_config.yml
@ -65,6 +65,15 @@ marked:
|
||||
prependRoot: true
|
||||
postAsset: true
|
||||
|
||||
# Math
|
||||
mathjax:
|
||||
tags: none # or 'ams' or 'all'
|
||||
single_dollars: true # enable single dollar signs as in-line math delimiters
|
||||
cjk_width: 0.9 # relative CJK char width
|
||||
normal_width: 0.6 # relative normal (monospace) width
|
||||
append_css: true # add CSS to pages rendered by MathJax
|
||||
every_page: true # if true, every page will be rendered by MathJax regardless the `mathjax` setting in Front-matter
|
||||
|
||||
# Highlight
|
||||
highlight:
|
||||
enable: true
|
||||
@ -117,21 +126,9 @@ ignore:
|
||||
# Extensions
|
||||
## Plugins: https://hexo.io/plugins/
|
||||
## Themes: https://hexo.io/themes/
|
||||
theme: vanilla
|
||||
|
||||
# Deployment
|
||||
## Docs: https://hexo.io/docs/one-command-deployment
|
||||
deploy:
|
||||
type: "github_pages"
|
||||
|
||||
mathjax:
|
||||
tags: none # or 'ams' or 'all'
|
||||
single_dollars: true # enable single dollar signs as in-line math delimiters
|
||||
cjk_width: 0.9 # relative CJK char width
|
||||
normal_width: 0.6 # relative normal (monospace) width
|
||||
append_css: true # add CSS to pages rendered by MathJax
|
||||
every_page: true # if true, every page will be rendered by MathJax regardless the `mathjax` setting in Front-matter
|
||||
theme: uio
|
||||
|
||||
# Feed
|
||||
feed:
|
||||
type: atom
|
||||
path: feed.xml
|
||||
|
@ -2,15 +2,16 @@
|
||||
title: Toxicity Analysis in Vtuber Live Chat
|
||||
---
|
||||
|
||||
A little experiment on classifying toxic comments, and how I ended up from GBDT to Transformer-based model.
|
||||
A little exploration and experiment on classifying toxic comments.
|
||||
|
||||
# Why
|
||||
|
||||
The motivation is simple; I just feel sad when they look suffered from toxic comments in live chats. The goal is also simple: design an automated system to spot toxic comments and quarantine them.
|
||||
The motivation is simple; I just feel sad when they look suffered from toxic comments in live chats. The goal is also simple: design an automated system to spot toxic comments and destroy them.
|
||||
|
||||
# Data Data Data
|
||||
|
||||
> I can't make bricks without clay.
|
||||
> — Sherlock Holmes
|
||||
|
||||
I need a myriad of live chat comments and moderation events for analysis and future use.
|
||||
|
||||
@ -86,13 +87,13 @@ Here's a [t-SNE](https://en.wikipedia.org/wiki/T-distributed_stochastic_neighbor
|
||||
|
||||
# Omake
|
||||
|
||||
## Hololive-10M Dataset
|
||||
## Hololive Dataset
|
||||
|
||||
I made collected chat events publicly available for those interested in further research.
|
||||
|
||||
The dataset contains:
|
||||
|
||||
- Chats - 10 million
|
||||
- Chats
|
||||
- Superchats (amount, currency)
|
||||
- Retraction events
|
||||
- Moderation events (ban, delete)
|
||||
|
Before Width: | Height: | Size: 9.2 KiB After Width: | Height: | Size: 9.2 KiB |
Before Width: | Height: | Size: 1.3 KiB After Width: | Height: | Size: 1.3 KiB |
Before Width: | Height: | Size: 2.0 KiB After Width: | Height: | Size: 2.0 KiB |
Before Width: | Height: | Size: 131 KiB After Width: | Height: | Size: 131 KiB |
Before Width: | Height: | Size: 1.2 KiB After Width: | Height: | Size: 1.2 KiB |
Before Width: | Height: | Size: 1.2 KiB After Width: | Height: | Size: 1.2 KiB |