DynamicPDF Core Suite brings tons of PDF creation and manipulation features into your Visual Studio application and includes full .NET Standard 2.0 support. Whether your need is to bring in an existing PDF document (for Form Filling, Form Flattening, Merging, Splitting, Adding New Pages, Stamping... More information
Tokenization of raw text is a standard pre-processing step for many NLP tasks. For English, tokenization usually involves punctuation splitting and separation of some affixes like possessives. Other languages require more extensive token pre-processing, which is usually called segmentation.
Innovoft.IO.SplitReaders is a library for efficient csv column reading. When reading small csv files and on one thread using a TextReader and doing a string.Split(...) is not an issue. But when reading 100 GB csv files on 48 threads, string creation ends up being an issue because of all the GC that... More information