#38 list continuations #126

xyz65535 · 2024-09-05T17:38:20Z

#38 list continuations

ListItem: added attaching block paragraph/block/admonition
List: added multiple tests in various combinations of the above e.g. nested lists, attached paragraph, attached admonition, two attached paragraphs, nested list with attached paragraph, nested list with attached multiline paragraph, attached paragraph and nested list

Important improvement: code in Coradoc::Parser::Asciidoc::Base that converts grammar rules written as ruby methods into proper parslet rules, which as it turned out was not the case previously. On top of that, it goes beyond what parslet is capable by also supporting arguments. Due to this improvement time for running all the tests came down (on tested hardware, approximately) from 50 seconds to 10 seconds (5x speedup) and time for running utils/round_trip.rb came down from 2.5 minutes to 3 seconds (50x speedup), which is also realistic input. This improvement makes it possible to iterate on grammar rules much faster, develop parsing inlines and potentially parsing input from various sources, for example test cases from asciidoc, in much more reasonable time.

Coradoc::Parser::Asciidoc::Base was changed from a module to a class, rules previously contained there were moved to module Coradoc::Parser::Asciidoc::Text

Other improvements:

Blocks: added missing block types: listing, open. added missing properties in literal block. (based on things noticed with utils/round_trip.rb)
Grammar rules to be used inside of other rules e.g. line_start?, line_not_text?. Purpose of those rules is making sure rules using them are applied by parser only in places they should be applied.
Tests for blocks, paragraph, table, two parsing bugs to be fixed
Minor fixes in tests

Metanorma PR checklist

Breaking changes (list related PRs)
Documentation update required (create task for this)
External dependency introduced (documentation update need)
Gem with native library introduced

codecov · 2024-09-05T18:06:35Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.02%. Comparing base (5904502) to head (ec037ce).
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #126   +/-   ##
=======================================
  Coverage   97.02%   97.02%           
=======================================
  Files           3        3           
  Lines         168      168           
=======================================
  Hits          163      163           
  Misses          5        5

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ronaldtse

Can this be merged or is it still WIP?

- ListItem: added attaching block paragraph/block/admonition - List: added multiple tests in various combinations of the above e.g. nested lists, attached paragraph, attached admonition, two attached paragraphs, nested list with attached paragraph, nested list with attached multiline paragraph, attached paragraph and nested list Important improvement: code in Coradoc::Parser::Asciidoc::Base that converts grammar rules written as ruby methods into proper parslet rules, which as it turned out was not the case previously. On top of that, it goes beyond what parslet is capable by also supporting arguments. Due to this improvement time for running all the tests came down (on tested hardware, approximately) from 50 seconds to 10 seconds (5x speedup) and time for running utils/round_trip.rb came down from 2.5 minutes to 3 seconds (50x speedup), which is also realistic input. This improvement makes it possible to iterate on grammar rules much faster, develop parsing inlines and potentially parsing input from various sources, for example test cases from asciidoc, in much more reasonable time. - Coradoc::Parser::Asciidoc::Base was changed from a module to a class, rules previously contained there were moved to module Coradoc::Parser::Asciidoc::Text Other improvements: - Blocks: added missing block types: listing, open. added missing properties in literal block. (based on things noticed with utils/round_trip.rb) - Grammar rules to be used inside of other rules e.g. line_start?, line_not_text?. Purpose of those rules is making sure rules using them are applied by parser only in places they should be applied. - Tests for blocks, paragraph, table, two parsing bugs to be fixed - Minor fixes in tests

ronaldtse · 2024-11-06T04:12:25Z

spec/coradoc/parser/asciidoc/content_spec.rb

+      table = ast.first[:table]
+
+
+      obj = {:table=>


Why is this not a Ruby object tree? There should NOT be any hashes.

Coradoc does:

Coradoc text -> AST -> Ruby object tree

In particular, those tests are for the first step. And AST is represented with Ruby hashes.

ronaldtse · 2024-11-06T04:14:31Z

utils/round_trip.rb

@@ -30,7 +30,7 @@
    generated_adoc = Coradoc::Generator.gen_adoc(doc)
    cleaned_adoc = Coradoc::Input::HTML.cleaner.tidy(generated_adoc)
    File.open("#{file_path}.roundtrip","w"){|f| f.write(cleaned_adoc)}
-    `diff -B #{file_path} #{file_path}.roundtrip > #{file_path}.roundtrip.diff`
+    `diff -BNaur #{file_path} #{file_path}.roundtrip > #{file_path}.roundtrip.diff`


Better write a separate RSpec matcher that compares 2 Coradoc Document trees in addition to two string compares?

It is just an utility at this point. It would make sense to make some roundtripping tests, but at the time we are not at 100% coverage.

ReesePlews · 2024-11-06T08:51:54Z

is this revision related to this issue #139 ?

webdev778 · 2024-11-06T08:55:39Z

is this revision related to this issue #139 ?

#38

ReesePlews · 2024-11-06T09:10:56Z

thanks @webdev778 if you have time, #139 also describes lists that are being truncated, it could be similar. please discuss with @ronaldtse

hmdne · 2024-11-06T18:58:54Z

@ReesePlews This PR is for AsciiDoc parsing. You probably mean HTML conversion to AsciiDoc. So this is unrelated.

xyz65535 force-pushed the list_continuations2 branch from d1f097d to ec037ce Compare September 5, 2024 17:44

webdev778 requested a review from ronaldtse September 5, 2024 21:22

ronaldtse approved these changes Oct 4, 2024

View reviewed changes

xyz65535 added 2 commits October 23, 2024 00:17

fixing bug for broken utils/round_trip.rb

41438d9

xyz65535 force-pushed the list_continuations2 branch from ec037ce to 76c1718 Compare November 5, 2024 23:33

xyz65535 changed the title ~~WIP #38 list continuations~~ #38 list continuations Nov 5, 2024

hmdne linked an issue Nov 5, 2024 that may be closed by this pull request

ListItem formatting needs to deal with HardBreaks #38

Open

ronaldtse requested changes Nov 6, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#38 list continuations #126

#38 list continuations #126

xyz65535 commented Sep 5, 2024 •

edited

Loading

codecov bot commented Sep 5, 2024

ronaldtse left a comment

ronaldtse Nov 6, 2024

hmdne Nov 6, 2024

ronaldtse Nov 6, 2024 •

edited

Loading

hmdne Nov 13, 2024

ReesePlews commented Nov 6, 2024

webdev778 commented Nov 6, 2024

ReesePlews commented Nov 6, 2024

hmdne commented Nov 6, 2024

#38 list continuations #126

Are you sure you want to change the base?

#38 list continuations #126

Conversation

xyz65535 commented Sep 5, 2024 • edited Loading

Metanorma PR checklist

codecov bot commented Sep 5, 2024

Codecov Report

ronaldtse left a comment

Choose a reason for hiding this comment

ronaldtse Nov 6, 2024

Choose a reason for hiding this comment

hmdne Nov 6, 2024

Choose a reason for hiding this comment

ronaldtse Nov 6, 2024 • edited Loading

Choose a reason for hiding this comment

hmdne Nov 13, 2024

Choose a reason for hiding this comment

ReesePlews commented Nov 6, 2024

webdev778 commented Nov 6, 2024

ReesePlews commented Nov 6, 2024

hmdne commented Nov 6, 2024

xyz65535 commented Sep 5, 2024 •

edited

Loading

ronaldtse Nov 6, 2024 •

edited

Loading