JSONDR (JSON Doctor) 🩺

Welcome to JSONDR, your friendly tool to convert any webpage to JSON. Whether you need structured data for analysis or to collect links for your next web project, JSONDR has you covered.

The tool can be used directly using the prefix https://jsondr.com/ or you can download jsondr.py and use it in your code. It's essentially a BeatifulSoup wrapper, maybe switching to Playwright would be better for dynamic data. Check out (Jina's Reader tool)[https://jina.ai/reader/], which this project was inspired by (also that project is more stable, I just wanted JSON).

Overview

JSONDR allows you to effortlessly convert web pages to JSON data by simply adding a prefix to any URL. Inspired by Jina's reader, JSONDR was built specifically to handle web data in a structured format.

Using JSONDR Directly

To use JSONDR directly, follow these steps:

Copy any URL of your choice.
Add the prefix jsondr.com/ to the URL, like so: jsondr.com/[your-url].
Replace [your-url] with the actual website link, e.g., jsondr.com/untapped.vc.
Paste the updated link in your browser and see the data presented in JSON format.

This is an easy and efficient way to grab links, texts, and metadata from any static webpage.

Using jsondr.py

If you'd like to integrate JSONDR directly into your Python projects, you can utilize jsondr.py. Here’s how:

Clone or Download: Clone this repository or copy jsondr.py into your project.

Import and Use:

import jsondr

# Example usage
url = 'http://example.com'
extracted_data = jsondr.extract_content(url)
print(extracted_data)

Functions Overview:
- extract_content(url): Main function that returns all extracted data (text, links, forms, tables, and metadata) as JSON.
- extract_metadata(soup, base_url): Extracts the title, description, and other metadata of the page.
- extract_texts_and_links(soup, base_url, base_domain): Extracts all text elements and links from the page.
- extract_forms(soup, base_url): Extracts form elements on the page.
- extract_tables(soup): Extracts table data from the page.

Development and Contributions

We're open to contributions that improve the project! If you'd like to contribute or suggest features, follow these steps:

Fork the repository.
Create a new branch.
Make your changes and commit them.
Open a pull request with a clear description.

Thank you for your interest in JSONDR! Feel free to reach out with any questions or issues.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
templates		templates
LICENSE		LICENSE
README.md		README.md
jsondr.py		jsondr.py
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JSONDR (JSON Doctor) 🩺

Table of Contents

Overview

Using JSONDR Directly

Using jsondr.py

Development and Contributions

About

Releases

Packages

Languages

License

yoheinakajima/jsondr

Folders and files

Latest commit

History

Repository files navigation

JSONDR (JSON Doctor) 🩺

Table of Contents

Overview

Using JSONDR Directly

Using jsondr.py

Development and Contributions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages