-
-
Notifications
You must be signed in to change notification settings - Fork 380
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
stackoverflow exception when load pages has lots of nested <br> tags #415
Comments
Hello @MachineLearning666 , I'm not sure if that something we fixed in more recent version but I tried to load this page and everything seem to work as expected: var doc = new HtmlDocument();
doc.LoadHtml(File.ReadAllText(@"C:\Users\Jonathan\Desktop\br_issue.html"));
var html = doc.DocumentNode.InnerHtml; Could you try it with the latest version? Best Regards, Jon Performance Libraries Runtime Evaluation |
Hi @JonathanMagnan, Thanks for the quick response. I think you may use view-source:http://ca800.com/biz/d_1_1nrusru51nrkj.html to get the page's html content (~180Kb). I tried it work as expected in all old/new versions. What I tried the page is from a crawler. It maybe an old version of the page. page's size is about 800Kb. |
Thank you @MachineLearning666 , We can now successfully reproduce it. We will look at it. |
Hello @MachineLearning666 , The v1.11.28 has been released. Could you try it and let us know if the Best Regards, Jon |
Hi @JonathanMagnan , the issue is fully fixed. Thanks a lot. |
Awesome @MachineLearning666 ! Don't hesitate to contact us with any questions, issues or feedback. Best regards, Jon |
Description
There is a stackoverflow issue when load page has lots nested tags, i tried to set MaxDepthLevel=100, OptionMaxNestedChildNodes=50, OptionAutoCloseOnEnd=true, OptionFixNestedTags=true, but it didn't work
Exception thrown by the code htmlDoc.LoadHtml(html);
Exception
StackOverflow
The text was updated successfully, but these errors were encountered: