Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

assess quality of Author disambiguation #3

Open
VladimirAlexiev opened this issue Aug 26, 2022 · 0 comments
Open

assess quality of Author disambiguation #3

VladimirAlexiev opened this issue Aug 26, 2022 · 0 comments

Comments

@VladimirAlexiev
Copy link

VladimirAlexiev commented Aug 26, 2022

I've read https://github.com/almugabo/openalex_qa/tree/main/scope, and you've made a good start here!

What do you think about Authors?

But 213M is still way too high, so I think a lot of author records are duplicated:

  • When I used MAG, they had 15 records of me until I "claimed" and merged them
  • VIAF had 22M personal records (clusters) in 2020-09: https://catalogo.pusc.it/beyond_viaf/
  • ORCID had 11.5M authors in 2021-05: https://www.wikidata.org/wiki/Property:P496
  • So I think the total number of researchers world-wide (dead and living) is maybe 15M, which would mean that OpenAlex authors have 14 duplicated records on average
@VladimirAlexiev VladimirAlexiev changed the title assess quality of Authors assess quality of Author disambiguation Aug 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant