Hey @everyone! π
Weβre super excited to announce the release of text2typeql, a new open-source dataset designed to help LLMs master TypeQL! π§
π 14k Pairs: Natural language questions mapped to TypeQL 3.0.
π 15 Diverse Domains: From Social Networks and Financial Crime to Supply Chains and Game of Thrones.
β
Validated: Every query is validated against a strict TypeQL schema.
π€ Cypher Comparison: Includes side-by-side Cypher queries (derived from Neo4j Labs' benchmark) so you can compare patterns.
Not only for foundation LLM model creators to use! If you want to fine-tune a local model for TypeQL generation, this is the dataset for you!
π Read the Blog: https://typedb.com/blog/improving-llm-understanding-of-typeql-with-text2typeql
π» Get the Dataset (free & open source under Apache 2.0): https://github.com/typedb-osi/text2typeql
Joshua Β· 3w ago