# TODO for the Python knowpro port
## Big functionality
- Serialization and deserialization (do this before queries since fully re-indexing takes too long)
- Queries and searches (so we can finally test everyhing)
- Retries for embeddings (for robustness -- TypeChat already retries)
## Small functionality
- Implement merge_action_knowledge() in convknowledge.py
- Implement sharing an embedding model between vectorbases (requires model and emb.size in TextEmbeddingSettings)
- Move the embeddings cache into the model so it can be shared between different vector bases
- Implement Podcast._build_caches(), by implementing create_term_embedding_cache()
## Refactoring implementations
- Change some inconsistent module names
- Rewrite podcast parsing without regexes
## Main/demo/test program support
- Unify various dotenv calls and make them search harder (relative to repo)
## Type checking stuff
- fuzzy_index type mismatch (could VectorBase be made to match the type?)
- Fix need for `# type: ignore` comments (usually need to make some I-interface generic in actual message/index/etc.)
## Low priority
- Various remove...() or delete...() methods (so far unused in TS knowPro)microsoft/TypeAgent
Publicmirrored from https://github.com/microsoft/TypeAgentAvailable
python/ta/TODO.md
32lines · modepreview