Skip to content

This repository provides an API for converting PDF documents into structured JSON data. The API is build using FastAPI web framework for building APIs with Python. The conversion process utilizes industry-standard libraries for PDF parsing and JSON serialization, ensuring accurate extraction of content from PDF files.

Notifications You must be signed in to change notification settings

justkr/pdf_to_structured_json

Repository files navigation

PDF to structured JSON

Description

This repository provides an API for converting PDF documents into structured JSON data. The API is build using FastAPI web framework for building APIs with Python. The conversion process utilizes industry-standard libraries for PDF parsing and JSON serialization, ensuring accurate extraction of content from PDF files.

Features

  • PDF to JSON conversion
  • File upload functionality
  • Displaying structured JSON output

Technologies Used

  • backend language - Python,
  • backend framework - FastAPI,
  • frontend - HTML, CSS, JavaScript,
  • Docker

Setup Instructions

  1. Clone the repository
git clone https://github.com/justkr/pdf_to_structured_json.git
  1. Build and run the Docker container
docker build -t pdf_reader_api .
docker run -p 8000:80 pdf_reader_api
  1. Access the API at
http://localhost:8000

Press CTRL+C to quit API

API Documentation

Introduction

This FastAPI-based API provides functionality for uploading PDF files and extracting structured text for them.

Endpoints

    • Method: 'POST'
    • Endpoint: '/'
    • Description: Upload a PDF file for text extraction

    • Method: 'GET'
    • Endpoint: '/'
    • Description: Display extracted text from PDF in structured JSON form

About

This repository provides an API for converting PDF documents into structured JSON data. The API is build using FastAPI web framework for building APIs with Python. The conversion process utilizes industry-standard libraries for PDF parsing and JSON serialization, ensuring accurate extraction of content from PDF files.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published