TikaOnDotNet 1.17.1

Bare-bones IKVM Java-to-.NET port of Apache Tika. You'll want to install TikaOnDotNet.TextExtractor.

Install-Package TikaOnDotNet -Version 1.17.1
dotnet add package TikaOnDotNet --version 1.17.1
<PackageReference Include="TikaOnDotNet" Version="1.17.1" />
For projects that support PackageReference, copy this XML node into the project file to reference the package.
paket add TikaOnDotNet --version 1.17.1
The NuGet Team does not provide support for this client. Please contact its maintainers for support.

Release Notes

- Add new overloads to the `TextExtractor.Extract` allowing users to provide their own extraction result assemblers. Example:
```cs
public class CustomResult
{
public string Text { get; set; }
public IDictionary&lt;string, string[]&gt; Metadata { get; set; }
}
public static CustomResult CreateCustomResult(string text, Metadata metadata)
{
var metaDataDictionary = metadata.names().ToDictionary(name =&gt; name, metadata.getValues);
return new CustomResult
{
Metadata = metaDataDictionary,
Text = text,
};
}
[Test]
public void should_extract_author_list_from_pdf()
{
var textExtractionResult = new TextExtractor().Extract("file_with_authors.pdf", CreateCustomResult);
textExtractionResult.Metadata["meta:author"].Should().ContainInOrder("Fred Jones, M. D.", "Donald Evans D. M.");
}
```

    • IKVM (>= 8.1.5717)

This package is not used by any popular GitHub repositories.

Version History

Version Downloads Last updated
1.17.1 35,380 4/3/2018
1.17.0 8,472 2/15/2018
1.16.0 13,140 7/30/2017
1.15.0 291 7/30/2017
1.14.2 7,971 4/22/2017
1.14.2-pre 299 4/15/2017
1.14.1 66,014 1/13/2017
1.14.0 1,345 12/8/2016
1.13.1 2,877 8/16/2016
1.13.0 1,239 6/30/2016
1.12.2 6,003 4/12/2016
1.12.1 320 4/12/2016
1.12.0 604 4/11/2016
1.7.0 8,008 2/6/2015
1.6.4.51427 2,471 1/16/2015
1.6.3 3,307 9/27/2014
1.6.2.1 1,101 6/5/2014
1.6.0 442 6/5/2014
1.5.2 435 5/30/2014
1.5.0 865 3/5/2014
1.4.0.51459 1,699 7/12/2013