Log in
Skip to sidebar
Skip to main content
Linked Applications
Loading…
Apache Software Foundation
Spaces
Hit enter to search
Help
Online Help
Keyboard Shortcuts
Feed Builder
What’s new
What’s new
Available Gadgets
About Confluence
Log in
TIKA
Pages
Page tree
Browse pages
Configure
Space tools
View Page
A
t
tachments (0)
Page History
Page Information
View in Hierarchy
View Source
Delete comments
Export to PDF
Export to Word
Copy Page Tree
Pages
…
Home
Parsers
TikaOCR
Page Information
Title:
TikaOCR
Author:
ASF Infrabot
Mar 26, 2019
Last Changed by:
Tim Allison
Oct 14, 2021
Tiny Link:
(useful for email)
https://cwiki.apache.org/confluence/x/ECOGBg
Export As:
Word
·
PDF
Incoming Links
TIKA (2)
Page:
Migrating to Tika 2.0.0
Home page:
Home
Hierarchy
Parent Page
Page:
Parsers
Labels
There are no labels assigned to this page.
Recent Changes
Time
Editor
Oct 14, 2021 14:36
Tim Allison
View Changes
Apr 08, 2021 12:20
Tim Allison
View Changes
Feb 10, 2021 15:58
Tim Allison
View Changes
Jan 13, 2021 19:25
Tim Allison
View Changes
Jan 05, 2021 19:53
Tim Allison
View Page History
Outgoing Links
External Links (14)
https://github.com/tesseract-ocr/tesseract/wiki/4.0-Accurac…
https://github.com/tesseract-ocr/tesseract/wiki
localhost:9998/tika
https://github.com/tesseract-ocr/tesseract/issues/263
localhost:9998/rmeta/text
https://tesseract-ocr.googlecode.com
https://wiki.apache.org/tika/PDFParser%20%28Apache%20PDFBox…
https://gist.github.com/henrik/1967035
https://issues.apache.org/jira/browse/TIKA-93
https://dl.fedoraproject.org/pub/epel/epel-release-latest-7…
https://github.com/tesseract-ocr/tesseract/issues/1171
https://github.com/UB-Mannheim/tesseract/wiki
https://imagemagick.org/script/download.php
https://imagemagick.org/download/binaries/ImageMagick-7.0.1…
Overview
Content Tools
Apps
{"serverDuration": 75, "requestCorrelationId": "d648a2f2ecd11177"}