TypeAgent · GitVita

# TODO for the Python knowpro port

## Big functionality

- Serialization and deserialization (do this before queries since fully re-indexing takes too long)
- Queries and searches (so we can finally test everyhing)
- Retries for embeddings (for robustness -- TypeChat already retries)

## Small functionality

- Implement merge_action_knowledge() in convknowledge.py
- Implement sharing an embedding model between vectorbases (requires model and emb.size in TextEmbeddingSettings)
- Move the embeddings cache into the model so it can be shared between different vector bases
- Implement Podcast._build_caches(), by implementing create_term_embedding_cache()

## Refactoring implementations

- Change some inconsistent module names
- Rewrite podcast parsing without regexes

## Main/demo/test program support

- Unify various dotenv calls and make them search harder (relative to repo)

## Type checking stuff

- fuzzy_index type mismatch (could VectorBase be made to match the type?)
- Fix need for `# type: ignore` comments (usually need to make some I-interface generic in actual message/index/etc.)

## Low priority

- Various remove...() or delete...() methods (so far unused in TS knowPro)
microsoft/TypeAgent

Branches

Tags

Clone

python/ta/TODO.md