Taxonomy & Ontology Expert
Gramian Consulting Group View all jobs
- Ireland
- Permanent
- Full-time
Contract: 12-18 months contract or FTENote: This is not a job for an engineer (Data Scientist, AI/ML). This is a Data Analyst/Entry type of a role.About the TeamThe Indexing Team is part of the ML organization, specializing in indexing: processing massive datasets to enable ranking, search, and more. We handle real-time event processing and asynchronous batch jobs from diverse sources.We partner closely with backend and ML teams to integrate our Indexing Platform with ML services, ensuring frictionless index creation and top-tier customer experiences.Our platform fuels high-impact features: enterprise search, user feed ranking, and content understanding— all at B2C scale (10M+ DAU).Key Responsibilities
- Onboard new entities into the Ads team Knowledge Graph, ensuring alignment with defined schemas and ontologies
- Serve as the primary point of contact for Product, Engineering, and Analytics teams requesting entity additions or schema changes
- Prioritize and triage incoming requests based on business impact, urgency, and technical feasibility
- Perform data entry and labeling to support automated content understanding pipelines
- Tune and refine LLM prompts to improve content understanding automation
- Review and curate LLM-generated entities, approving high-confidence outputs, correcting near-misses, and rejecting hallucinations
- Execute rigorous QA and validation to ensure ontological consistency and factual accuracy
- Create Golden Sets via targeted labeling to assess model quality and support classifier and extraction model fine-tuning
- Investigate and resolve data integrity issues, including error correction, duplicate entity merging, and relationship conflict resolution
- Provide clear stakeholder feedback, status updates, and modeling rationale, and guide teams on effective Knowledge Graph querying
- 3+ years of experience with Knowledge Graph
- Strong Understanding of graph concepts (Nodes, Edges, Properties)
- Experience with Taxonomy & Ontology - Experience categorizing data, managing hierarchies, and understanding semantic relationships between entities.
- Proficiency in navigating complex datasets.
- Experience with SQL, SPARQL, or Cypher is a strong plus.
- Understanding of how Generative AI works, common failure modes (hallucinations), and the importance of ground-truth data in training.