OVERVIEW

COREKG is a coreset-based framework for personalized knowledge graph summarization. It uses sensitivity-based importance sampling to create compact, user-centric knowledge graph (KG) summaries that preserve query-relevant information with provable guarantees.

SPARQL Query

A structured query language used to retrieve and manipulate data stored in Resource Description Framework (RDF) format within knowledge graphs.

Example query:

SELECT ?person ?organization WHERE {

?person http://schema.org/worksFor ?organization .

?organization http://schema.org/founder ?founder . }

DEPENDENCIES AND INSTALLATION

Required Python libraries:

numpy

tqdm

rdflib

SPARQLWrapper

Install using pip:

pip install numpy tqdm rdflib SPARQLWrapper

EVALUATION METRICS

F1 Score – Accuracy of answers compared to the full knowledge graph.

Coverage – Proportion of query-relevant triples retained.

DATASETS AND QUERY WORKLOADS:

Wikidata Dataset – https://dumps.wikimedia.org/wikidatawiki/entities/

Wikidata Query Dataset (LC-QuAD 2.0) – https://huggingface.co/datasets/mohnish/lc_quad/blob/main/data.zip

DBpedia Dataset – https://databus.dbpedia.org/dbpedia/collections/latest-core

DBpedia Query Dataset (LSQ) – http://lsq.aksw.org/

LSQ-Clean Toolkit – https://github.com/sparqeology/lsq-clean

Freebase Dataset – https://developers.google.com/freebase

Freebase Query Dataset (WebQSP) – https://aclanthology.org/P16-2033.pdf

SOFTWARE AND TOOLS USED

Apache Jena Fuseki – Used as a SPARQL endpoint for query execution. Website: https://jena.apache.org/documentation/fuseki2/

LSQ-Clean Toolkit – For preprocessing SPARQL logs and creating cleaned query workloads. Website: https://github.com/sparqeology/lsq-clean

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
corekg		corekg
.gitignore		.gitignore
COREKG_Appendix.pdf		COREKG_Appendix.pdf
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OVERVIEW

DEPENDENCIES AND INSTALLATION

EVALUATION METRICS

DATASETS AND QUERY WORKLOADS:

SOFTWARE AND TOOLS USED

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

OVERVIEW

DEPENDENCIES AND INSTALLATION

EVALUATION METRICS

DATASETS AND QUERY WORKLOADS:

SOFTWARE AND TOOLS USED

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages