A .NET wrapper for tesseract-ocr.
Since tesseract and leptonica binaries are compiled with Visual Studio 2012 you'll need to ensure you have the Visual Studio 2012 Runtime installed. This can be found here.
You will also need to download the language data files for tesseract 3.02 from tesseract-ocr.
- Add the
Tesseract
NuGet Package by runningInstall-Package Tesseract
from the Package Manager Console. - Ensure you have Visual Studio 2012 x86 & x64 runtimes installed (see note above).
- Download language data files for tesseract 3.02 from tesseract-ocr and add them to your project, ensure 'Copy to output directory' is set to Always.
- Check out the Samples solution
~/Samples/Tesseract.Samples.sln
for a working example
If you run into any issues please check out this wiki page which details a number common issues and some potential solutions.
Please create your pull requests to target the "develop" branch. "Master" is only for released code. Thank you.
Copyright 2012 Charles Weld.
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this software except in compliance with the License. You may obtain a copy of the License at:
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.
- charlesw (Charles Weld)
A big thanks to GitHub and all of Tesseract's contributors:
Also thanks to the following projects\resources without which this project would not exist in it's current form:
- Reactive Extensions - The basic idea from which the build\packaging system is built on.
- TwainDotNet - Batch build script
- Tesseract-dot-net - The origianl dot net wrapper that started all this.
- Interop with Native Libraries - Stacks of useful information about c# P/Invoke and Marshaling