PDF OCR Tool

Overview

This tool is designed to perform Optical Character Recognition (OCR) on .pdf, .jpg, .jpeg, and .png files. It allows users to extract text from documents, making the content searchable and editable.

Features

PDF Text Extraction: Extract text content from files.
Searchable Content: Convert scanned documents or image-based files into searchable text.
File Conversion: Converts files to .txt for easy of editing or further processing.

Installation

Clone this repository:

git clone https://github.com/LogPRose/NoteConverter.git

Navigate to the project directory:
```
cd NoteConverter 
```
Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Run the tool and provide the path to the PDF file you want to process:
```
python NoteConverter.py -i example.pdf 
```
Run the tool with the help flag to see all options:
```
python NoteConverter.py --help 
```

Examples

Extract text from a PDF file and save it as plain text:

python NoteConverter.py --sd /user/Logan/482slides -td /user/Logan/482Notes

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Conversion.py		Conversion.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF OCR Tool

Overview

Features

Installation

Usage

Examples

About

Releases

Packages

Languages

LogPRose/NoteConverter

Folders and files

Latest commit

History

Repository files navigation

PDF OCR Tool

Overview

Features

Installation

Usage

Examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages