Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

entropy again (case of multilinks) #45

Open
holtzermann17 opened this issue May 24, 2013 · 4 comments
Open

entropy again (case of multilinks) #45

holtzermann17 opened this issue May 24, 2013 · 4 comments
Assignees
Milestone

Comments

@holtzermann17
Copy link
Collaborator

In #29 we talked some about entropy of words (interesting reading)!

I noticed one potential simply application. This article contains 5 links for the term "equivalent":

(AFAICT, none of them are correct.)

I think that when we have multi-links like this, it's a pretty clear case of a "diluted" meaning. This is very different from the idea of entropy I was promoting in #29, where I was thinking about words like "group". The solution proposed in #44 might work better for "group".

But I feel like getting 5 links back is clear evidence that the system is overlinking (overthinking?). Can we run some tests (perhaps using the build system)?

For now I think the goal would be only be one of exploration and descriptive statistics.

@dginev
Copy link
Owner

dginev commented May 28, 2013

I would say the equivalencerelation link is appropriate.

As to the overlinking, I have no clue about the MSC classes - if they were in the same MSC class as some of the other links on the page (which seem ok), then there is no way for my current algorithm to figure out they don't apply.

Each of the non-generic links for equivalence should only be used in their very specific domain (e.g. forcing notions), which they currently aren't.

I think I have an idea how to prevent very specific concepts appearing in other classes, by introducing a handicap for specificity (in a way adding some of the old MSC steering logic back in). If a concept is defined within a very concrete MSC class, it should add a penalty for being added to the group of concepts in the top-level MSC classes being compared. Hm...

@ghost ghost assigned dginev Jun 21, 2013
@dginev
Copy link
Owner

dginev commented Jul 11, 2013

Moving to the 3.0 release milestone.

@holtzermann17
Copy link
Collaborator Author

Here's another example that in my view pushes the limit a bit too far:
http://planetmath.org/proofthatgrouphomomorphismspreserveidentity#comment-19582
Eight outbound links for homomorphism.

This is a comment, so in order to get it to link correctly, we'd have to find a way (on the Planetary side) to send in MSC information about the parent article.

@dginev
Copy link
Owner

dginev commented Nov 3, 2013

Haha, that's remarkable :) Maybe I should take your report as a sign to get back to improving NNexus (especially with December rapidly approaching).

As to sending in the MSC information, my implementation currently can't use any MSC info on input (I think), but maybe that is something we can enhance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants