Anytime Large-Scale Analytics of Linked Open Data

Analytical queries are queries with numerical aggregators: computing the average number of objects per property, identifying the most frequent subjects, etc. Such queries are essential to monitor the quality and the content of the Linked Open Data (LOD) cloud. Many analytical queries cannot be executed directly on the SPARQL endpoints, because the fair use policy cuts off expensive queries. This paper shows how to rewrite such queries into a set of queries that each satisfy the fair use policy.

Publication

Anytime Large-Scale Analytics of Linked Open Data. Arnaud Soulet and Fabian M. Suchanek, Full paper at ISWC19 (research track).

Results of experiments

We provide the results of all our experiments in the results directory:

Validation on DBpedia:
- Quality of convergence: dbpedia_convergence.csv
- Top-k precision: dbpedia_top.csv
- Efficiency: dbpedia_efficiency.csv
Usage query on the LOD cloud: LOD_statistics.csv
Representativeness of LOD: LOD_representativeness.csv

Source code

We provide the Java source code of the prototype:

Usage query: This implementation measures property usage and class usage in the statistics directory.
Representativeness of LOD: This implementation computes for each property, a distribution over the frequency of the first significant digit of the number of objects per subject in the representativeness directory. We used the method proposed here to convert this distribution into a score between 0 and 1 that measures the "representativeness" of the triplestores.

Acknowledgements

This work was partially supported by the grant ANR-16-CE23-0007-01 (“DICOS”).

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
representativeness		representativeness
results		results
statistics		statistics
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anytime Large-Scale Analytics of Linked Open Data

Publication

Results of experiments

Source code

Acknowledgements

About

Releases

Packages

Languages

License

asoulet/iswc19analytics

Folders and files

Latest commit

History

Repository files navigation

Anytime Large-Scale Analytics of Linked Open Data

Publication

Results of experiments

Source code

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages