Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parse minister-clause. #17

Open
helgihg opened this issue Jan 21, 2024 · 0 comments
Open

Parse minister-clause. #17

helgihg opened this issue Jan 21, 2024 · 0 comments

Comments

@helgihg
Copy link
Contributor

helgihg commented Jan 21, 2024

What we call a "minister-clause" is currently a kind of metadata present early in each legal document.

Example:

<minister-clause>&lt;a href=&quot;https://www.althingi.is/thingstorf/thingmalalistar-eftir-thingum/ferill/?ltg=83&amp;amp;mnr=54&quot;&gt; &lt;i&gt; Ferill málsins á Alþingi. &lt;/i&gt; &lt;/a&gt; &lt;a href=&quot;https://www.althingi.is/altext/pdf/83/s/0054.pdf&quot;&gt; &lt;i&gt; Frumvarp til laga. &lt;/i&gt; &lt;/a&gt; &lt;br/&gt; &lt;br/&gt; &lt;small&gt; &lt;b&gt; Felld úr gildi skv. &lt;a href=&quot;https://www.althingi.is/altext/stjt/2020.100.html&quot;&gt; l. 100/2020 &lt;/a&gt; , 111. gr. &lt;/b&gt; &lt;/small&gt;</minister-clause>

It is HTML which contains two valuable points of information:

  1. A history of legal changes (not necessarily valuable since it's parsable from footnotes).
  2. More importantly: Information about which ministry and minister the law refers to.

This information should be parsed and retained in formal XML instead of only human-readable HTML.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant