Commit f271a87
committed
[RELEASE] iText 7 pdfOcr - 1.0.0
https://git.itextsupport.com/
* release/1.0.0:
[RELEASE] 1.0.0-SNAPSHOT -> 1.0.0
Hide possibility to set userWords
Refactor MultiThreadingTest test to reuse code from IntegrationTestHelper
Make the scope of a method stricter
A couple of small fixes to remove workarounds from code
Allow tesseract4 events from com.itextpdf.pdfocr space
Implement pdfOcr licensing
Create jar with sources in Maven
User words file is unexpectedly removed from disk
Increase test timeouts
Deploy jar with sources to Artifactory as the add-on is open source
Make pdfOcr classes autoportable
Remove licensekey version property from root pom file
Fix issue with saving processed images
Performance drop on some complex halftone images
Improve artifact descriptions
Improve Javadocs for Tesseract implementations
Small fix to avoid inner class in .NET
Change in Jenkinsfile to abort possible already running automatic builds
Change in Jenkinsfile so that the automatic build is only blocked when the build for itextcore for Java is running
Hide AbstractTesseract4OcrEngine#doTesseractOcr(File , List<File>, OutputFormat, int)
AbstractIntegrationTest#testSimpleTextOutput is triggered 13 times PDFOC-89
Add copyright headers
Add license information
Fix several Javadoc and code remarks PDFOC-84
Throw proper exceptions in case the Tesseract prerequisites have possbily not been met
Add FontProvider mechanism PDFOC-73
Update .mailmap
Improve test coverage
Add ActualText if there are NotDef glyphs
Introduce an option not to add layers to output PDF file PDFOC-74
Move NOTICE.txt to another directory
Update log message
Update comments
Update command structure for executable
Fix remarks related to TesseractOcrUtil class and add check for NOTDEF glyphs
Update target branch for sonar
Fix various code remarks
Fix various code and API design remarks
Split to two modules
Change name of root artifact
Remove clirr-maven-plugin
Improve test coverage
Split to two modules
Fix for SonarQube analysis
PDFOC-65 Fix various code remarks in test code
PDFOC-65 Fix various code remarks in test code
Set user_defined_dpi
Fix various code and API design remarks
Fix various code and API design remarks
On Linux the VM crashes at times to build the Java version of pdfOCR
Remove vulnerable dependency
On Linux the VM crashes at times to build the Java version of pdfOCR
Fix various code remarks
Build only on windows until PDFOC-68 is fixed
Add category to tests
Refactor test to junit ExpectedException
Add test for invalid font
Fix javadoc issues
Fix code style for enums
Rename test files
Refactor ocr images method and remove ImgFormat enum
Add license info for fonts
Refactor exceptions and log messages
Remove unused method
Add tests for log messages
Remove unused test files
Remove commented code
Change Jsoup to styled-xml-parser and fix according to review
Refactoring for porting to .net
Update test files
Fix for user words
Update dependencies
Update tests
Fix text positioning
Add tests for PDFCOC-31,32,33,34
Refactor image preprocessing
Add tests for ppm images
Remove creating the sources jar from pom.xml
Remove creating the sources jar from the Jenkinsfile
Fix ocr for ppm images
Fix Jenkinsfile: mvn workspace repository for Windows machines
Check ppm on linux
Fix for eng language
Fix for tmp file in tmp directory
Add default language with adding user-words
Fix getting font path
Fix for embedded font in jar
Refactoring for porting to .net
Add custom user words
Fix wrong message in OCRException
Add tests for text files
Add tests for text file output
Add possibility to OCR to a file + refactoring for multipage tiffs
Small refactoring, add test for ppm images
Fix for PNM images
Fix for tif images
Performance improvements of Jenkins builds
Update text positioning PDFOC-18
Add gitattributes
Update default font
Add .gitignore
Add tests for path to hocr script
Update preprocessing
Fix tests
Make path to tess data mandatory
Add separator for tess data path
Remove createPdfA3u parameter
Update tests with transparent text
Update TextInfo to public
Change default text color to transparent
Add new font for tests
Add comments
Add greek test
Add missed test pdf
Update compare tool test
Replace few tests using compare tool
Fix for tiff images
Add preprocessing and fix tests
Add logging for exceptions
Move tests for lib
Add tesseract lib and tests
Update images coordinates calculation
Update tesseract dir
Add null check for imagedata
Update scale mode tests
Update default scale mode
Update exception handling
Add new test image
Add empty text test
Update tests and code style
Add basic exception handling and cosmetic refactoring
Add placeholder in case of corrupted images
Update tests with compare tool
Refactoring according to the checkstyle plugin checks
Add japicmp plugin
Clean up dependencies
Fix logging lib
Add tests for tiff
Update tests for new tess data files
Add tests for scripts
Update directories structure and add tests for languages
Update exception handling
Update temp filenames in tests
Add tests for pdfa3u
Add tests and update structure
Add extended tests using compare tool
first approach185 files changed
Lines changed: 14404 additions & 82 deletions
File tree
- pdfocr-api
- src
- main
- java/com/itextpdf/pdfocr
- events
- resources/com/itextpdf/pdfocr
- fonts
- test
- java/com/itextpdf
- metainfo
- pdfocr
- events
- helpers
- resources/com/itextpdf/pdfocr
- events
- fonts
- images
- profiles
- pdfocr-tesseract4
- src
- main/java/com/itextpdf/pdfocr/tesseract4
- events
- test
- java/com/itextpdf
- metainfo
- pdfocr
- events
- multithreading
- general
- imageformats
- pdfa3u
- pdflayers
- tessdata
- tesseract4
- resources/com/itextpdf/pdfocr
- documents
- events/multithreading
- fonts
- images
- tèst
- profiles
- tessdata
- script
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
| 107 | + | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
| 119 | + | |
| 120 | + | |
| 121 | + | |
| 122 | + | |
| 123 | + | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
| 147 | + | |
| 148 | + | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
| 70 | + | |
| 71 | + | |
| 72 | + | |
| 73 | + | |
| 74 | + | |
| 75 | + | |
| 76 | + | |
| 77 | + | |
| 78 | + | |
| 79 | + | |
0 commit comments