You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
**Wikidata Textifier** is an API that transforms Wikidata items into compact format for use in LLMs and GenAI applications. It resolves missing labels of properties and claim values by querying the Wikidata Action API, making it efficient and suitable for AI pipelines.
3
+
**Wikidata Textifier** is an API that transforms Wikidata entities into compact outputs for LLM and GenAI use cases.
4
+
It resolves missing labels for properties and claim values using the Wikidata Action API and caches labels to reduce repeated lookups.
4
5
5
-
🔗 Live API: [https://wd-textify.toolforge.org/](https://wd-textify.toolforge.org/)
6
+
Live API: [wd-textify.wmcloud.org](https://wd-textify.wmcloud.org/)\
7
+
API Docs: [wd-textify.wmcloud.org/docs](https://wd-textify.wmcloud.org/docs)
6
8
7
-
---
9
+
## Features
8
10
9
-
## Functionalities
11
+
- Textify Wikidata entities as `json`, `text`, or `triplet`.
12
+
- Resolve labels for linked entities and properties.
13
+
- Cache labels in MariaDB for faster repeated requests.
14
+
- Support multilingual output with fallback language support.
15
+
- Avoid SPARQL and use Wikidata Action API / EntityData endpoints.
10
16
11
-
-**Textifies** any Wikidata item into a readable or JSON format suitable for LLMs.
12
-
-**Resolves all labels**, including those missing when querying the Wikidata API.
13
-
-**Caches labels** for 90 days to boost performance and reduce API load.
14
-
-**Avoids SPARQL** and uses the Wikidata Action API for better efficiency and compatibility.
15
-
-**Hosted on Toolforge**: [https://wd-textify.toolforge.org/](https://wd-textify.toolforge.org/)
17
+
## Output Formats
16
18
17
-
---
19
+
-`json`: Structured representation with claims (and optionally qualifiers/references).
20
+
-`text`: Readable summary including label, description, aliases, and attributes.
21
+
-`triplet`: Triplet-style lines with labels and IDs for graph-style traversal.
18
22
19
-
## Formats
20
-
21
-
-**Text**: A textual representation or summary of the Wikidata item, including its label, description, aliases, and claims. Useful for helping LLMs understand what the item represents.
22
-
-**Triplet**: Outputs each triplet as a structured line, including labels and IDs, but omits descriptions and aliases. Ideal for agentic LLMs to traverse and explore Wikidata.
23
-
-**JSON**: A structured and compact representation of the full item, suitable for custom formats.
0 commit comments