Skip to content

Feature implementation from commits c02ab09..54885b8#3

Open
yashuatla wants to merge 15 commits intofeature-base-3from
feature-head-3
Open

Feature implementation from commits c02ab09..54885b8#3
yashuatla wants to merge 15 commits intofeature-base-3from
feature-head-3

Conversation

@yashuatla
Copy link
Copy Markdown
Owner

This PR contains changes from a range of commits from the original repository.

Commit Range: c02ab09..54885b8
Files Changed: 240 (204 programming files)
Programming Ratio: 85.0%

Commits included:

natoverse and others added 15 commits February 10, 2025 11:00
* Add cache to build_noun_graph

* Semver
* Add vector store id reference to embeddings config.

* added communities to links and maxvals

* Consistent naming

* Update entity_ids to include index_name

* added consistent logging messages to miq cli

* semversioner

---------

Co-authored-by: Derek Worthen <worthend.derek@gmail.com>
Co-authored-by: Nathan Evans <github@talkswithnumbers.com>
* add new options

* add metadata json into input document

* remove doc change

* add metadata column into text loader

* prepend_metadata

* run fix

* fix tests and patch

* fix test

* add watrning for metadata tokens > config size

* fix typo and run fix

* fix test_integration

* fix test

* run check

* rename and fix chunking

* fix

* fix

* fiz test verbs

* fix

* fix tests

* fix chunking

* fix index

* fix cosmos test

* fix vars

* fix after PR

* fix
* Properly export the NLP community reports prompt

* Semver

* Fix verb tests
* consolidate query api functions and remove code duplication

* refactor and remove more code duplication

* Add semversioner file

* fix basic search

* fix drift search and update base class function names

* update example notebooks
* Remove most iterrow usages

* Semver

* Ruff

* Pyright

* Format
* Rework update output structure

* Semver

* Fix unit test

* Update frequency in incremental

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* Add children to the community tables

* Replace NaN children with empty list

* Replace subcommunity logic with built-in parent/child fields

* Remove restore_community_hierarchy

* Add children and frequency to migration notebook

* Format

* Semver

* Add children to reports

* Update tests

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* Add workflow registration

* Add ability to mutate config by workflows

* Separate graph finalization

* Separate graph pruning

* Semver

* Update tests

* Update smoke tests

* Fix iterrows on create_graph

* Remove prune_graph from llm construction

* Update test data

* Remove prune_graph from smoke tests
* Patch json mode on Community Reports

* Semversioner

* Wording oopsie
* Move flow functions to workflow

* Remove redundant workflow_name variable

* Semver
* Add callbacks to global search

* Add callbacks to local search

* Add streaming callbacks in local search CLI

* Add callbacks to basic search

* Add callbacks to DRIFT search

* Semver

* Return generators directly in API

* Guard callbacks
…upling (microsoft#1724)

* Base structure

* Add fnllm providers and Mock LLM

* Remove fnllm coupling, introduce llm providers

* Ruff + Tests fix

* Spellcheck

* Semver

* Format

* Default MockChat params

* Fix more tests

* Fix embedding smoke test

* Fix embeddings smoke test

* Fix MockEmbeddingLLM

* Rename LLM to model. Package organization

* Fix prompt tuning

* Oops

* Oops II
* Refactor config defaults

- Implement type-safe, hierarchical dataclass for config
defaults instead of namespaced constants.
- Allow for instantiating config directly from defaults data structure.

* fix vector_store db_uri default

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants