Do you need to file an issue?
Describe the bug
Running update_output twice throws
ValueError: Could not find update_output/YYYYMMDD-HHMMSS/delta/communities.parquet in storage!
this issue is related to here
key = self._keyname(key) becomes (in my case) 'output\update_output\YYYYMMDD-HHMMSS\delta\communities.parquet'
I guess this is because of the logic from the previous version
Steps to reproduce
- init the graphrag root dir,
- create the index
- add a file and update the index
- add another file and update the index
Expected Behavior
it should update normally the index once again, with both documents added in step 3 and 4 present in the index
GraphRAG Config Used
...
update_index_output:
type: blob
provider: azure
storage_account_blob_url: ${BLOB_STORAGE_URL}
container_name: ${ROOT_DIR}
base_dir: "update_output"
...
Logs and screenshots
No response
Additional Information
- GraphRAG Version: 2.1.0
- Operating System: win 11
- Python Version: 3.14
- Related Issues:
Do you need to file an issue?
Describe the bug
Running update_output twice throws
ValueError: Could not find update_output/YYYYMMDD-HHMMSS/delta/communities.parquet in storage!
this issue is related to here
key = self._keyname(key)becomes (in my case) 'output\update_output\YYYYMMDD-HHMMSS\delta\communities.parquet'I guess this is because of the logic from the previous version
Steps to reproduce
Expected Behavior
it should update normally the index once again, with both documents added in step 3 and 4 present in the index
GraphRAG Config Used
Logs and screenshots
No response
Additional Information