12 packages returned for Tags:"tokenizer"

The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text. It supports the most common NLP tasks, such as tokenization, sentence segmentation, part-of-speech tagging, named entity extraction, chunking, parsing, and coreference resolution. These tasks... More information
VBF Compilers Library for Scanners
VBF.Compilers.Scanners is a scanner builder. It contains a regular expression to DFA engine, can generate high performance scanners for unicode source text.
Natural language query parser and rule-based named entity recognizer.
NLQuery: natural language query parser recognizes entities in context of structured sources: tabular data (database, indexed data). Can be used for building natural language interface to SQL database or OLAP cube, implementing custom search engine.