package mecab

  1. Overview
  2. Docs
An OCaml binding of MeCab, a part-of-speech and morphological analyzer.

Install

Dune Dependency

Authors

Maintainers

Sources

v0.0.0.tar.gz
sha256=479b271ea2dc05e019b121396593b610b067d67650961358e33d2199dcf51810
md5=e17ad800f8bd989a51feb0cafd16d903

Description

MeCab is a open-sourced part-of-speech and morphological analyzer independent from languages, dictionaries, and corpuses, developed by a joint project team of Graduate School of Informatics in Kyoto University and NTT Communication Science Laboratories.

Published: 14 Nov 2017

README

ocaml-mecab

An OCaml binding of MeCab, a part-of-speech and morphological analyzer.

MeCab is a open-sourced part-of-speech and morphological analyzer independent from languages, dictionaries, and corpuses, developed by a joint project team of Graduate School of Informatics in Kyoto University and NTT Communication Science Laboratories. MeCab is almost faster than ChaSen, Juman, and KAKASHI. The official documentation (written in Japanese) is published at MeCab: Yet Another Part-of-Speech and Morphological Analyzer, and English-translated version is at MeCab English Documentation.

Getting started

Install the latest-released version by

opam install mecab

or the latest snapshot by opam pin add mecab ..

You can parse a sentense as follows:

# #require "mecab" ;;

# let mecab = Mecab.Tagger.create [|""|];;
val mecab : Mecab.Tagger.t = <abstr>

# Mecab.Tagger.sparse_tostr mecab "すもももももももものうち" |> print_endline;;
すもも	名詞,一般,*,*,*,*,すもも,スモモ,スモモ
も	助詞,係助詞,*,*,*,*,も,モ,モ
もも	名詞,一般,*,*,*,*,もも,モモ,モモ
も	助詞,係助詞,*,*,*,*,も,モ,モ
もも	名詞,一般,*,*,*,*,もも,モモ,モモ
の	助詞,連体化,*,*,*,*,の,ノ,ノ
うち	名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
EOS

- : unit = ()

Documentation

Dependencies (8)

  1. jbuilder >= "1.0+beta7"
  2. ocamlfind build & >= "1.5.0"
  3. ocaml-migrate-parsetree build & < "2.0.0"
  4. ppx_sexp_conv
  5. sexplib
  6. camomile < "2.0.0"
  7. conf-mecab >= "0.996"
  8. ocaml

Dev Dependencies

None

Used by

None

Conflicts

None

OCaml

Innovation. Community. Security.