Skip to content

Commit

Permalink
Adding Scraping for HackerNews/BleepingComputer
Browse files Browse the repository at this point in the history
  • Loading branch information
kaiiyer committed Apr 24, 2024
1 parent 0fe1ffa commit 9bf076d
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@ pip install -r requirements.txt
## Usage
```
## Run the following code and paste the DIFR Report URL you are interested in:
python dfir_report_collector.py
python report_collector.py
## if you konow a bit of python, you can turn this code to be part of localGPT.py
## Run the following code to Normalize, create chunks and upload to 'vault.txt'
Expand Down
2 changes: 1 addition & 1 deletion txt_normalizer.py
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@

def upload_txtfile():
# Normalize whitespace and clean up text
with open("XYZ.txt", "r", encoding="utf-8") as vault_file:
with open("XYZ.txt", "r", encoding="utf-8") as vault_file: #Change Filename Here
text = vault_file.read()
text = re.sub(r'\s+', ' ', text).strip()

Expand Down

0 comments on commit 9bf076d

Please sign in to comment.