Skip to content

Commit

Permalink
[Scotland] Spot dangling paragraph before timestamp.
Browse files Browse the repository at this point in the history
  • Loading branch information
TheyWorkForYou Live CVS User committed May 3, 2024
1 parent 5dccfc3 commit c5409de
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion pyscraper/sp_2024/parse.py
Original file line number Diff line number Diff line change
Expand Up @@ -76,7 +76,7 @@ def process_raw_html(raw_html: Tag, agenda_item_url: str) -> BeautifulSoup:
# Deal with timestamps that are not inside anything first
raw_html = str(raw_html)
raw_html = re.sub(
"(?m)^\s*(.*?)\s*<br/>\s*<br/>", r"<timestamp>\1</timestamp>", raw_html
'(?:<p class="or-contribution-box">)?\s*(.*?)\s*<br/>\s*<br/>', r"<timestamp>\1</timestamp>", raw_html
)
soup = BeautifulSoup(raw_html, "html.parser")

Expand Down

0 comments on commit c5409de

Please sign in to comment.