TextMining-ScrapCleanSummarize

This repository contains code for scraping, cleaning, and summarizing text data from websites. It provides a comprehensive process for extracting valuable information from online sources and condensing it into summarized text.

Installation

To use this code, follow these steps:

Clone this repository:

git clone https://github.com/chaymabh/TextMining-ScrapCleanSummarize.git
cd TextMining-ScrapCleanSummarize

Install the required dependencies:
```
pip install -r requirements.txt
```

Usage

Web Scraping: Use Data_collecter_and_cleaner.py to perform web scraping. This code leverages Beautiful Soup for automated data extraction.
Text Summarization: Utilize Text_summarizer.py to generate concise summaries of text. It uses the NLTK library for this purpose.
Customization: This code is designed for web scraping and text summarization but can be customized to suit your specific needs. Feel free to modify it for different websites and data sources.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Data_collecter_and_cleaner.py		Data_collecter_and_cleaner.py
README.md		README.md
Text_summarizer.py		Text_summarizer.py
Visualization.py		Visualization.py
events_cleaned.csv		events_cleaned.csv
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TextMining-ScrapCleanSummarize

Table of Contents

Installation

Usage

License

About

Releases

Packages

Languages

chaymabh/TextMining-ScrapCleanSummarize

Folders and files

Latest commit

History

Repository files navigation

TextMining-ScrapCleanSummarize

Table of Contents

Installation

Usage

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages