Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NNexus Future #54

Open
dginev opened this issue Oct 5, 2019 · 0 comments
Open

NNexus Future #54

dginev opened this issue Oct 5, 2019 · 0 comments

Comments

@dginev
Copy link
Owner

dginev commented Oct 5, 2019

I have cleaned the repository down to ten issues, in two projects:

I still consider adopting a better discovery method from the state of art #10 as the most impactful workload.

Renewing the concept indexes would also be nice to stay fresh (though tedious, as one needs to ensure the crawling code is still correct in the 2019 version of the websites).

The big feature set I hid under the rug was regarding "invalidation" #16, which is platform-dependent and difficult to implement in the standalone repo.

Lastly, depending on who and when has time to invest in upgrading nnexus into a 2019 "best in class" tool, I am also tempted to suggest a second rewrite, this time departing Perl for good. If I was to be the one undertaking it, I would certainly choose Rust, which is a great language for this type of tooling. I can point to the convenience of the statement classification showcase I recently completed, which is a decent exhibit to what a future nnexus web service may look like. That said, the port can be done in parts, where the index database generation can remain in Perl for a while - web crawling is definitely one of the places perl code is just much quicker for getting things done.

If you're wondering about the current project's size, here is the report from the cloc tool:


github.com/AlDanial/cloc v 1.76  T=1.07 s (40.2 files/s, 13542.3 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
Perl                            27            807           1522           6492
HTML                             9            442            147           3754
make                             1            222            107            712
Markdown                         2             45              0             98
JSON                             1              0              0             56
YAML                             2              0              0             39
JavaScript                       1              1             16             32
-------------------------------------------------------------------------------
SUM:                            43           1517           1792          11183
-------------------------------------------------------------------------------

P.S. I should also say that I am not making rewrite suggestions lightly - I have ported all of my research work into rust, and I credit it as one of the main reasons the tooling I ended up with is usable and productive. Nnexus is not yet experiencing any explicit disadvantage from being in perl, except maybe the much more limited community of potential developers.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant