openai/chatkit-python

Public

mirrored fromhttps://github.com/openai/chatkit-pythonAvailable

Watch0 Fork0 Star0

Code Commits Issues Pull requests Actions Insights Security

v1.5.1

Find a branch or tag

Branches

v1.5.1

Clone

HTTPS

Download ZIP

chatkit-python/docs/guides

docs/guides/accept-rich-user-input.md

304lines · modecode

Raw Download

Latest commit unavailable.

unknown

1	`# Accept rich user input`
2
3	`This guide explains how a ChatKit server accepts user input beyond plain text—such as attachments and @-mentions—and makes it available to your inference pipeline.`
4
5	`At a high level:`
6
7	`- Attachments let users upload files that your model can read.`
8	`- @-mentions let users tag entities so the model does not have to guess from free text.`
9
10	`## Attachments: let users upload files`
11
12	`Let users attach files/images by turning on client support, choosing an upload strategy, wiring the upload endpoints, and converting attachments to model inputs.`
13
14	`### Enable attachments in the client`
15
16	`Turn on attachments in the composer and configure client-side limits:`
17
18	```ts
19	`const chatkit = useChatKit({`
20	`// ...`
21	`composer: {`
22	`attachments: {`
23	`enabled: true,`
24	`// configure accepted MIME types, count, and size limits here`
25	`},`
26	`},`
27	`});`
28	```
29
30	Under the hood this maps to `ChatKitOptions.composer.attachments`; see the [`composer.attachments` docs](https://openai.github.io/chatkit-js/api/openai/chatkit/type-aliases/composeroption/#attachments) for all available options.
31
32	`### Configure an upload strategy`
33
34	Set [`ChatKitOptions.api.uploadStrategy`](https://openai.github.io/chatkit-js/api/openai/chatkit/type-aliases/fileuploadstrategy/) to:
35
36	- Direct: your backend exposes a single upload URL that accepts the bytes and writes attachment metadata to your `Store`. Simpler and faster when you control uploads directly from the app server.
37	- Two-phase: the client makes a ChatKit API request to create an attachment metadata record (which forwards the request to `AttachmentStore`), you return an `upload_url` as part of the created attachment metadata, and the client uploads bytes in a second step. Prefer this when you front object storage with presigned/temporary URLs or want to offload upload bandwidth (for example, to a third-party blob storage).
38
39	Both strategies still require an `AttachmentStore` for delete cleanup. Choose direct for simplicity on the same origin; choose two-phase for cloud storage and larger files.
40
41	`### Enforce attachment access control`
42
43	Neither attachment metadata nor file bytes are protected by ChatKit. Use the `context` passed into your `AttachmentStore` methods to authorize every create/read/delete. Only return IDs, bytes, or signed URLs when the caller owns the attachment, and prefer short-lived download URLs. Skipping these checks can leak customer data.
44
45	`### Direct upload`
46
47	Add the upload endpoint referenced in `uploadStrategy`. It must:
48
49	- accept `multipart/form-data` with a `file` field,
50	`- store the bytes wherever you like,`
51	- create `Attachment` metadata, persist it via `Store.save_attachment`, and
52	- return the `Attachment` JSON.
53
54	Implement `AttachmentStore.delete_attachment` to delete the stored bytes; `ChatKitServer` will then call `Store.delete_attachment` to drop metadata.
55
56	`Example client configuration:`
57
58	```js
59	`{`
60	`type: "direct",`
61	`uploadUrl: "/files",`
62	`}`
63	```
64
65	`Example FastAPI direct upload endpoint:`
66
67	```python
68	`@app.post("/files")`
69	`async def upload_file(request: Request):`
70	`form_data = await request.form()`
71	`file = form_data.get("file")`
72
73	`# Your blob store upload`
74	`attachment = await upload_to_blob_store(file)`
75
76	`return Response(content=attachment.model_dump_json(), media_type="application/json")`
77	```
78
79	`### Two-phase upload`
80
81	Implement `AttachmentStore.create_attachment` to:
82
83	- build an `upload_url` that accepts `multipart/form-data` with a `file` field (direct PUTs are currently not supported),
84	- build the `Attachment` model,
85	- persist it via `Store.save_attachment`, and
86	`- return it.`
87
88	Implement `AttachmentStore.delete_attachment` to delete the stored bytes; `ChatKitServer` will call `Store.delete_attachment` afterward.
89
90	- The client POSTs the bytes to `upload_url` after it receives the created attachment metadata in the response.
91
92	`Client configuration:`
93
94	```js
95	`{`
96	`type: "two_phase",`
97	`}`
98	```
99
100	`Example two-phase store issuing a multipart upload URL:`
101
102	```python
103	`attachment_store = BlobAttachmentStore()`
104	`server = MyChatKitServer(store=data_store, attachment_store=attachment_store)`
105
106
107	`class BlobAttachmentStore(AttachmentStore[RequestContext]):`
108	`def generate_attachment_id(self, mime_type: str, context: RequestContext) -> str:`
109	`return f\"att_{uuid4().hex}\"`
110
111	`async def create_attachment(`
112	`self, input: AttachmentCreateParams, context: RequestContext`
113	`) -> Attachment:`
114	`att_id = self.generate_attachment_id(input.mime_type, context)`
115	`upload_url = issue_multipart_upload_url(att_id, input.mime_type) # your blob store`
116	`attachment = Attachment(`
117	`id=att_id,`
118	`mime_type=input.mime_type,`
119	`name=input.name,`
120	`upload_url=upload_url,`
121	`)`
122	`await data_store.save_attachment(attachment, context=context)`
123	`return attachment`
124
125	`async def delete_attachment(self, attachment_id: str, context: RequestContext) -> None:`
126	`await delete_blob(att_id=attachment_id) # your blob store`
127	```
128
129	`### Convert attachments to model input`
130
131	Attachments arrive on `input_user_message.attachments` in `ChatKitServer.respond`. The default `ThreadItemConverter` does not handle them, so subclass and implement `attachment_to_message_content` to return a `ResponseInputContentParam` before calling `Runner.run_streamed`.
132
133	`Example using a blob fetch helper:`
134
135	```python
136	`from chatkit.agents import ThreadItemConverter`
137	`from chatkit.types import ImageAttachment`
138	`from openai.types.responses import ResponseInputFileParam, ResponseInputImageParam`
139
140
141	`async def read_bytes(attachment_id: str) -> bytes:`
142	`... # fetch from your blob store`
143
144
145	`def as_data_url(mime: str, content: bytes) -> str:`
146	`return "data:" + mime + ";base64," + base64.b64encode(content).decode("utf-8")`
147
148
149	`class MyConverter(ThreadItemConverter):`
150	`async def attachment_to_message_content(self, attachment):`
151	`content = await read_bytes(attachment.id)`
152	`if isinstance(attachment, ImageAttachment):`
153	`return ResponseInputImageParam(`
154	`type="input_image",`
155	`detail="auto",`
156	`image_url=as_data_url(attachment.mime_type, content),`
157	`)`
158	`if attachment.mime_type == "application/pdf":`
159	`return ResponseInputFileParam(`
160	`type="input_file",`
161	`file_data=as_data_url(attachment.mime_type, content),`
162	`filename=attachment.name or "unknown",`
163	`)`
164	`# For other text formats, check for API support first before`
165	`# sending as a ResponseInputFileParam.`
166	```
167
168	`### Show image attachment previews in thread`
169
170	Set `ImageAttachment.preview_url` to allow the client to render thumbnails.
171
172	- If your preview URLs are permanent/public, set `preview_url` once when creating the attachment and persist it.
173	- If your storage uses expiring URLs, generate a fresh `preview_url` when returning attachment metadata (for example, in `Store.load_thread_items` and `Store.load_attachment`) rather than persisting a long-lived URL. In this case, returning a short-lived signed URL directly is the simplest approach. Alternatively, you may return a redirect that resolves to a temporary signed URL, as long as the final URL serves image bytes with appropriate CORS headers.
174
175	`## @-mentions: tag entities in user messages`
176
177	`Enable @-mentions so users can tag entities (like documents, tickets, or users) instead of pasting raw identifiers. Mentions travel through ChatKit as structured tags so the model can resolve entities instead of guessing from free text.`
178
179	`### Enable as-you-type entity lookup in the composer`
180
181	To enable entity tagging as @-mentions in the composer, configure [`entities.onTagSearch`](https://openai.github.io/chatkit-js/api/openai/chatkit/type-aliases/entitiesoption/#ontagsearch) as a ChatKit.js option.
182
183	`It should return a list of [Entity](https://openai.github.io/chatkit-js/api/openai/chatkit/type-aliases/entity/) objects that match the query string.`
184
185	```ts
186	`const chatkit = useChatKit({`
187	`// ...`
188	`entities: {`
189	`onTagSearch: async (query: string) => {`
190	`return [`
191	`{`
192	`id: "article_123",`
193	`title: "The Future of AI",`
194	`group: "Trending",`
195	`icon: "globe",`
196	`data: { type: "article" }`
197	`},`
198	`{`
199	`id: "article_124",`
200	`title: "One weird trick to improve your sleep",`
201	`group: "Trending",`
202	`icon: "globe",`
203	`data: { type: "article" }`
204	`},`
205	`]`
206	`},`
207	`},`
208	`})`
209	```
210
211	`### Convert tags into model input in your server`
212
213	Mentions arrive server-side as structured tags. Override `ThreadItemConverter.tag_to_message_content` to describe what each tag refers to and translate it into model-readable content.
214
215	`Example converter method that wraps the tagged entity details in custom markup:`
216
217	```python
218	`from chatkit.agents import ThreadItemConverter`
219	`from chatkit.types import UserMessageTagContent`
220	`from openai.types.responses import ResponseInputTextParam`
221
222
223	`class MyThreadItemConverter(ThreadItemConverter):`
224	`async def tag_to_message_content(`
225	`self, tag: UserMessageTagContent`
226	`) -> ResponseInputTextParam:`
227	`if tag.type == "article":`
228	`# Load or unpack the entity the tag refers to`
229	`summary = await fetch_article_summary(tag.id)`
230	`return ResponseInputTextParam(`
231	`type="input_text",`
232	`text=(`
233	`"<ARTICLE_TAG>\n"`
234	`f"ID: {tag.id}\n"`
235	`f"Title: {tag.text}\n"`
236	`f"Summary: {summary}\n"`
237	`"</ARTICLE_TAG>"`
238	`),`
239	`)`
240	```
241
242	`### Pair mentions with retrieval tool calls`
243
244	When the referenced content is too large to inline, keep the tag lean (id + short summary) and let the model fetch details via a tool. In your system prompt, tell the assistant to call the retrieval tool when it sees an `ARTICLE_TAG`.
245
246	`Example tool paired with the converter above:`
247
248	```python
249	`from agents import Agent, StopAtTools, RunContextWrapper, function_tool`
250	`from chatkit.agents import AgentContext`
251
252
253	`@function_tool(description_override="Fetch full article content by id.")`
254	`async def fetch_article(ctx: RunContextWrapper[AgentContext], article_id: str):`
255	`article = await load_article_content(article_id)`
256	`return {`
257	`"title": article.title,`
258	`"content": article.body,`
259	`"url": article.url,`
260	`}`
261
262
263	`assistant = Agent[AgentContext](`
264	`...,`
265	`tools=[fetch_article],`
266	`)`
267	```
268
269	In `tag_to_message_content`, include the id the tool expects (for example, `tag.id` or `tag.data["article_id"]`). The model can then decide to call `fetch_article` to pull the full text instead of relying solely on the brief summary in the tag.
270
271	`### Prompt the model about mentions`
272
273	`Add short system guidance to help the assistant understand the input item that adds details about the @-mention.`
274
275	`For example:`
276
277	```
278	`- <ARTICLE_TAG>...</ARTICLE_TAG> is a summary of an article the user referenced.`
279	`- Use it as trusted context when answering questions about that article.`
280	`- Do not restate the summary verbatim; answer the user’s question concisely.`
281	- Call the `fetch_article` tool with the article id from the tag when more
282	`detail is needed or the user asks for specifics not in the summary.`
283	```
284
285	`Combined with the converter above, the model receives explicit, disambiguated entity context while users keep a rich mention UI.`
286
287	`### Handle clicks and previews`
288
289	Clicks and hover previews apply to the tagged entities shown in past user messages. Mark an entity as interactive when you return it from `onTagSearch` so the client knows to wire these callbacks:
290
291	```ts
292	`{`
293	`id: "article_123",`
294	`title: "The Future of AI",`
295	`group: "Trending",`
296	`icon: "globe",`
297	`interactive: true, // clickable/previewable`
298	`data: { type: "article" }`
299	`}`
300	```
301
302	- `entities.onClick` fires when a user clicks a tag in the transcript. Handle navigation or open a detail view. See the [onClick option](https://openai.github.io/chatkit-js/api/openai/chatkit/type-aliases/entitiesoption/#onclick).
303	- `entities.onRequestPreview` runs when the user hovers or taps a tag that has `interactive: true`. Return a `BasicRoot` widget; you can build one with `WidgetTemplate.build_basic(...)` if you are building the preview widgets server-side. See the [onRequestPreview option](https://openai.github.io/chatkit-js/api/openai/chatkit/type-aliases/entitiesoption/#onrequestpreview).
304
305

openai/chatkit-python

Branches

Tags

Clone