PDF OCR Tool

Overview

This tool is designed to perform Optical Character Recognition (OCR) on .pdf, .jpg, .jpeg, and .png files. It allows users to extract text from documents, making the content searchable and editable.

Features

PDF Text Extraction: Extract text content from files.
Searchable Content: Convert scanned documents or image-based files into searchable text.
File Conversion: Converts files to .txt for easy of editing or further processing.

Installation

Clone this repository:

git clone https://github.com/LogPRose/NoteConverter.git

Navigate to the project directory:
```
cd NoteConverter 
```
Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Run the tool and provide the path to the PDF file you want to process:
```
python NoteConverter.py -i example.pdf 
```
Run the tool with the help flag to see all options:
```
python NoteConverter.py --help 
```

Examples

Extract text from a PDF file and save it as plain text:

python NoteConverter.py --sd /user/Logan/482slides -td /user/Logan/482Notes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

PDF OCR Tool

Overview

Features

Installation

Usage

Examples

Files

README.md

Latest commit

History

README.md

File metadata and controls

PDF OCR Tool

Overview

Features

Installation

Usage

Examples