docs: rename article slug
6
.gitignore
vendored
@ -1,13 +1,7 @@
|
|||||||
/.vscode
|
/.vscode
|
||||||
/_site
|
|
||||||
/.sass-cache
|
/.sass-cache
|
||||||
/node_modules/
|
/node_modules/
|
||||||
/npm-debug.log
|
/npm-debug.log
|
||||||
/.jekyll-metadata
|
|
||||||
/images/crushed
|
|
||||||
/.bundle
|
|
||||||
/vendor/bundle/
|
|
||||||
|
|
||||||
.DS_Store
|
.DS_Store
|
||||||
Thumbs.db
|
Thumbs.db
|
||||||
db.json
|
db.json
|
||||||
|
2
LICENSE
@ -1,6 +1,6 @@
|
|||||||
The MIT License (MIT)
|
The MIT License (MIT)
|
||||||
|
|
||||||
Copyright (c) 2020 Yasuaki Uechi (https://uechi.io)
|
Copyright (c) 2021 Yasuaki Uechi (https://uechi.io)
|
||||||
|
|
||||||
Permission is hereby granted, free of charge, to any person obtaining a copy
|
Permission is hereby granted, free of charge, to any person obtaining a copy
|
||||||
of this software and associated documentation files (the "Software"), to deal
|
of this software and associated documentation files (the "Software"), to deal
|
||||||
|
@ -1,6 +1,6 @@
|
|||||||
# uechi.io
|
# uechi.io
|
||||||
|
|
||||||
[](https://travis-ci.org/uetchy/uechi.io)
|
[](https://github.com/uetchy/uechi.io/actions?query=pages)
|
||||||
|
|
||||||
## Build
|
## Build
|
||||||
|
|
||||||
@ -8,12 +8,6 @@
|
|||||||
yarn build
|
yarn build
|
||||||
```
|
```
|
||||||
|
|
||||||
## Proofread
|
|
||||||
|
|
||||||
```
|
|
||||||
yarn test
|
|
||||||
```
|
|
||||||
|
|
||||||
## Publish
|
## Publish
|
||||||
|
|
||||||
```
|
```
|
||||||
|
25
_config.yml
@ -65,6 +65,15 @@ marked:
|
|||||||
prependRoot: true
|
prependRoot: true
|
||||||
postAsset: true
|
postAsset: true
|
||||||
|
|
||||||
|
# Math
|
||||||
|
mathjax:
|
||||||
|
tags: none # or 'ams' or 'all'
|
||||||
|
single_dollars: true # enable single dollar signs as in-line math delimiters
|
||||||
|
cjk_width: 0.9 # relative CJK char width
|
||||||
|
normal_width: 0.6 # relative normal (monospace) width
|
||||||
|
append_css: true # add CSS to pages rendered by MathJax
|
||||||
|
every_page: true # if true, every page will be rendered by MathJax regardless the `mathjax` setting in Front-matter
|
||||||
|
|
||||||
# Highlight
|
# Highlight
|
||||||
highlight:
|
highlight:
|
||||||
enable: true
|
enable: true
|
||||||
@ -117,21 +126,9 @@ ignore:
|
|||||||
# Extensions
|
# Extensions
|
||||||
## Plugins: https://hexo.io/plugins/
|
## Plugins: https://hexo.io/plugins/
|
||||||
## Themes: https://hexo.io/themes/
|
## Themes: https://hexo.io/themes/
|
||||||
theme: vanilla
|
theme: uio
|
||||||
|
|
||||||
# Deployment
|
|
||||||
## Docs: https://hexo.io/docs/one-command-deployment
|
|
||||||
deploy:
|
|
||||||
type: "github_pages"
|
|
||||||
|
|
||||||
mathjax:
|
|
||||||
tags: none # or 'ams' or 'all'
|
|
||||||
single_dollars: true # enable single dollar signs as in-line math delimiters
|
|
||||||
cjk_width: 0.9 # relative CJK char width
|
|
||||||
normal_width: 0.6 # relative normal (monospace) width
|
|
||||||
append_css: true # add CSS to pages rendered by MathJax
|
|
||||||
every_page: true # if true, every page will be rendered by MathJax regardless the `mathjax` setting in Front-matter
|
|
||||||
|
|
||||||
|
# Feed
|
||||||
feed:
|
feed:
|
||||||
type: atom
|
type: atom
|
||||||
path: feed.xml
|
path: feed.xml
|
||||||
|
@ -2,15 +2,16 @@
|
|||||||
title: Toxicity Analysis in Vtuber Live Chat
|
title: Toxicity Analysis in Vtuber Live Chat
|
||||||
---
|
---
|
||||||
|
|
||||||
A little experiment on classifying toxic comments, and how I ended up from GBDT to Transformer-based model.
|
A little exploration and experiment on classifying toxic comments.
|
||||||
|
|
||||||
# Why
|
# Why
|
||||||
|
|
||||||
The motivation is simple; I just feel sad when they look suffered from toxic comments in live chats. The goal is also simple: design an automated system to spot toxic comments and quarantine them.
|
The motivation is simple; I just feel sad when they look suffered from toxic comments in live chats. The goal is also simple: design an automated system to spot toxic comments and destroy them.
|
||||||
|
|
||||||
# Data Data Data
|
# Data Data Data
|
||||||
|
|
||||||
> I can't make bricks without clay.
|
> I can't make bricks without clay.
|
||||||
|
> — Sherlock Holmes
|
||||||
|
|
||||||
I need a myriad of live chat comments and moderation events for analysis and future use.
|
I need a myriad of live chat comments and moderation events for analysis and future use.
|
||||||
|
|
||||||
@ -86,13 +87,13 @@ Here's a [t-SNE](https://en.wikipedia.org/wiki/T-distributed_stochastic_neighbor
|
|||||||
|
|
||||||
# Omake
|
# Omake
|
||||||
|
|
||||||
## Hololive-10M Dataset
|
## Hololive Dataset
|
||||||
|
|
||||||
I made collected chat events publicly available for those interested in further research.
|
I made collected chat events publicly available for those interested in further research.
|
||||||
|
|
||||||
The dataset contains:
|
The dataset contains:
|
||||||
|
|
||||||
- Chats - 10 million
|
- Chats
|
||||||
- Superchats (amount, currency)
|
- Superchats (amount, currency)
|
||||||
- Retraction events
|
- Retraction events
|
||||||
- Moderation events (ban, delete)
|
- Moderation events (ban, delete)
|
||||||
|
Before Width: | Height: | Size: 9.2 KiB After Width: | Height: | Size: 9.2 KiB |
Before Width: | Height: | Size: 1.3 KiB After Width: | Height: | Size: 1.3 KiB |
Before Width: | Height: | Size: 2.0 KiB After Width: | Height: | Size: 2.0 KiB |
Before Width: | Height: | Size: 131 KiB After Width: | Height: | Size: 131 KiB |
Before Width: | Height: | Size: 1.2 KiB After Width: | Height: | Size: 1.2 KiB |
Before Width: | Height: | Size: 1.2 KiB After Width: | Height: | Size: 1.2 KiB |