Skip to content

Conversation

@dstengle
Copy link
Owner

Summary

  • Adapted entity processing to use the knowledgebase-vocabulary ontology
  • Updated entity recognition and RDF generation to align with the vocabulary specifications

Changes

  • Modified entity extraction to use vocabulary-defined entity types
  • Updated RDF triple generation to use proper predicates from knowledgebase-vocabulary
  • Ensured compatibility with the existing knowledge graph structure

Test Plan

  • Verify entity extraction works with test documents
  • Confirm RDF output uses correct vocabulary predicates
  • Test integration with Fuseki triple store
  • Validate that existing queries still function

🤖 Generated with Claude Code

Co-Authored-By: Claude noreply@anthropic.com

dstengle and others added 4 commits August 2, 2025 14:43
- Add KbPlaceholderDocument entity for wiki-style forward references
- Implement deterministic ID generation following ADR-0013 standards
- Update EntityService to use deterministic IDs instead of random UUIDs
- Add comprehensive ID normalization with Unicode NFKD and alphanumeric conversion
- Support for Person, Organization, Location, Project, Tag, and PlaceholderDocument entities
- Maintain RDF vocabulary compatibility with existing converter
- Enable wiki-based document linking with predictable entity identifiers

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
Signed-off-by: David Stenglein <dave@missingmass.com>
Signed-off-by: David Stenglein <dave@missingmass.com>
…uration

- Add local vocabulary cache at /vocabulary/ with kb.ttl file
- Create centralized vocabulary configuration module
- Add VERSION.json for tracking vocabulary metadata
- Implement sync script for updating from upstream repository
- Add comprehensive documentation for LLM agents
- Create ADR-0014 documenting the vocabulary reference strategy
- Update existing code to use centralized KB namespace import
- Add tests for vocabulary configuration and integration

This provides a deterministic, LLM-friendly approach to managing the
external vocabulary dependency while maintaining clear provenance.

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude <noreply@anthropic.com>
@dstengle dstengle merged commit b245e94 into main Aug 13, 2025
2 checks passed
@dstengle dstengle deleted the feature/vocabulary-adaptation branch August 13, 2025 18:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants