microsoft/TypeAgent

Public

mirrored from https://github.com/microsoft/TypeAgentAvailable

CodeCommitsIssuesPull requestsActionsInsightsSecurity
6c58aed1ea5545286d0f78a55de849ed41e6949c

Branches

Tags

  • No tags available.
0Branches0Tags
Go to file
Add file
Code

Clone

HTTPS

Download ZIP

python/ta/TODO.md

32lines · modepreview

# TODO for the Python knowpro port

## Big functionality

- Serialization and deserialization (do this before queries since fully re-indexing takes too long)
- Queries and searches (so we can finally test everyhing)
- Retries for embeddings (for robustness -- TypeChat already retries)

## Small functionality

- Implement merge_action_knowledge() in convknowledge.py
- Implement sharing an embedding model between vectorbases (requires model and emb.size in TextEmbeddingSettings)
- Move the embeddings cache into the model so it can be shared between different vector bases
- Implement Podcast._build_caches(), by implementing create_term_embedding_cache()

## Refactoring implementations

- Change some inconsistent module names
- Rewrite podcast parsing without regexes

## Main/demo/test program support

- Unify various dotenv calls and make them search harder (relative to repo)

## Type checking stuff

- fuzzy_index type mismatch (could VectorBase be made to match the type?)
- Fix need for `# type: ignore` comments (usually need to make some I-interface generic in actual message/index/etc.)

## Low priority

- Various remove...() or delete...() methods (so far unused in TS knowPro)