Seitenhistorie
Eine kurze Liste.
- Mecab (C++) https://github.com/taku910/mecab
- JUMAN http://mecab.sourceforge.net/ /nlp.ist.i.kyoto-u.ac.jp/index.php?JUMAN
- JUMAN++ http://code.google.com/p/mecab/nlp.ist.i.kyoto-u.ac.jp/index.php?JUMAN++
- Chasen http://chasen-legacy.sourceforge.jp/
- Cabocha httphttps://chasengithub.orgcom/~taku/softwaretaku910/cabocha/
Japanese Dependency Analysis using Cascaded Chunking - Sen/GoSen (Java) http://sourceforge.net/projects/itadaki/
- cmecab-java (JNI-binding für MeCab) http://code.google.com/p/cmecab-java/
- kuromoji (Java) https://github.com/atilika/kuromoji | http://www.atilika.org/
- Sudachi (Java) https://github.com/WorksApplications/Sudachi
- igo (Java) http://igo.sourceforge.jp/
- Rakuten MA (JavaScript) https://github.com/rakuten-nlp/rakutenma
- KyTea http://www.phontron.com/kytea/
Verfügbare Corpora
- IPADIC (nicht mehr gepflegt) http://sourceforge.jp/projects/ipadic/
- NAIST Japanese Dictionary (IPADIC Nachfolger) http://sourceforge.jp/projects/naist-jdic/
- Unidic httpshttp://wwwunidic.ninjal.tokuteicorpusac.jp/ | http:/dist/index.phpsourceforge.jp/projects/unidic/
- UniDic 近代文語 (basiert auf Unidic) http://wwwwww2.kokkenninjal.goac.jp/lrc/index.php?UniDic%2F%B6%E1%C2%E5%CA%B8%B8%ECUniDic
Überblick
Inhalte