URL Domain Counter

This Python script reads an Excel file, extracts domain names from URLs in the specified column (C column), and counts how many times each domain appears. The results are then saved to a new Excel file.

Prerequisites

Python 3.x
pandas
openpyxl

Installation

Install the required Python packages using pip:

pip install pandas openpyxl

Usage

Clone the repository:

git clone https://github.com/zackha/url-domain-counter-python.git
cd url-domain-counter-python

Run the script:

python url_domain_counter.py

Follow the prompts to select an input Excel file and specify an output file for the results.

How It Works

The script opens a file dialog for you to select an Excel file.
It reads URLs from the C column of the selected Excel file.
It extracts the domain names from the URLs and counts the occurrences of each domain.
It saves the results to a new Excel file, with domain names in the A column and their counts in the B column.

Example

Input Excel file (C column):

https://example.com/page1
https://example.com/page2
https://anotherdomain.com/page1
https://example.com/page3

Output Excel file:

Domain	Count
example.com	3
anotherdomain.com	1

License

This project is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
url_domain_counter.py		url_domain_counter.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

URL Domain Counter

Prerequisites

Installation

Usage

How It Works

Example

License

About

Releases

Packages

Languages

License

zackha/url-domain-counter-python

Folders and files

Latest commit

History

Repository files navigation

URL Domain Counter

Prerequisites

Installation

Usage

How It Works

Example

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages