ds2api

mirror of https://github.com/CJackHwang/ds2api.git synced 2026-05-04 00:15:28 +08:00

Author	SHA1	Message	Date
CJACK	e2756f800d	feat: introduce JSON UTF-8 validation middleware and prepend output integrity guard system prompt to messages	2026-05-02 02:22:34 +08:00
CJACK	55abf64717	feat: add model type support for file uploads with automatic resolution and header propagation	2026-05-02 00:55:17 +08:00
CJACK	0bca6e2cee	feat: implement context cancellation handling for chat and response stream runtimes to ensure clean termination without retries	2026-05-01 23:20:46 +08:00
CJACK.	934b40e572	Merge pull request #392 from wyv202011y/fix/timeout-and-context-cancel fix: increase stream timeout constants for large-context models; guar…	2026-05-01 23:17:31 +08:00
CJACK	dd5a0c5213	refactor: update and standardize current input file continuation prompt instructions	2026-05-01 22:27:59 +08:00
CJACK	43402e7a26	refactor: rename history file constant from HISTORY.txt to DS2API_HISTORY.txt across codebase and tests	2026-05-01 22:05:45 +08:00
CJACK	df1cfac9bc	refactor: replace history transcript format with numbered sections and rename upload file to HISTORY.txt	2026-05-01 21:15:17 +08:00
王	706e68de23	fix: increase stream timeout constants for large-context models; guard against context-cancelled double-recording - Increase StreamIdleTimeout from 90s to 300s and MaxKeepaliveCount from 10 to 40 to prevent premature stream termination with DeepSeek V4 Pro (~50K token contexts) - Add r.Context().Err() check after ConsumeSSE in empty_retry_runtime (chat + responses) to prevent historySession.error() from overwriting historySession.stopped() when the request context is cancelled References: - MaxKeepaliveCount=10 creates a 50s no-content timeout that kills the stream before DeepSeek V4 Pro can produce its first token with large contexts - Hermes Agent reports 'No response from provider for 180s' because the underlying SSE connection was already terminated by ds2api at 50s - Context cancellation path: OnContextDone -> stopped(), then finalize() with empty output -> retry -> error() overwrites stopped()	2026-05-01 21:11:36 +08:00
CJACK	2671298439	fix: coalesce small stream deltas to prevent character swallowing; add read-tool cache guard Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-05-01 13:53:27 +08:00
CJACK	92e321fe2c	修复吞字问题	2026-05-01 01:31:48 +08:00
CJACK.	95b7665643	Merge branch 'dev' into codex/run-all-tests-and-fix-failures	2026-04-30 02:39:18 +08:00
CJACK.	966f21211d	Fix nil-session guard in chat history test	2026-04-30 02:31:06 +08:00
NgoQuocViet2001	7dc3af40b2	feat(openai): add root route aliases	2026-04-30 01:24:53 +07:00
CJACK.	2f6b5ffda0	Fix current-input token text test expectation	2026-04-30 02:22:17 +08:00
CJACK.	7c3ff6ee7e	Merge pull request #374 from shern-point/feat/full-context-file-token-accounting Feat/full context file token accounting	2026-04-30 02:12:55 +08:00
CJACK.	63e62fd1b0	Merge pull request #372 from shern-point/feat/accurate-context-token-length Feat/accurate context token length	2026-04-30 02:11:32 +08:00
shern-point	6a778e0d35	feat: include inline-uploaded file tokens in context token accounting Track byte sizes of inline-uploaded files during PreprocessInlineFileInputs and convert them to conservative token estimates (bytes/3). RefFileTokens is threaded through StandardRequest into all OpenAI chat/responses usage builders so returned prompt_tokens/input_tokens reflect the full upstream context cost including attached files.	2026-04-30 01:42:51 +08:00
NgoQuocViet2001	9035c350a7	fix(openai): return 400 for inline file limit	2026-04-30 00:35:59 +07:00
shern-point	ba80052a26	fix: count uploaded file content in context token accounting PromptTokenText now reflects the actual downstream context cost: the uploaded IGNORE.txt file content plus the neutral live prompt, instead of only the pre-split prompt text.	2026-04-30 01:12:35 +08:00
shern-point	78fdd63470	feat: add full-context token regression coverage and docs Lock in the current_input_file regression with API-level tests and document that returned context token counts now track full prompt semantics with conservative sizing.	2026-04-30 00:46:06 +08:00
shern-point	4b4f097006	feat: use model-aware prompt counting in Gemini paths Preserve Gemini prompt token text during normalization and remove the hardcoded DeepSeek model from native Gemini usage helpers.	2026-04-30 00:46:05 +08:00
shern-point	d3018c281b	feat: use tokenizer-based counting in Claude token paths Unify Claude count_tokens, legacy stream accounting, and legacy render usage with preserved prompt text so Claude stops falling back to lossy message formatting.	2026-04-30 00:46:04 +08:00
shern-point	415a2359ad	feat: route OpenAI responses usage through preserved prompt text Use the stored full-context prompt text for responses accounting so neutral placeholder prompts do not underreport returned input token counts.	2026-04-30 00:45:31 +08:00
shern-point	f702d45a24	feat: route OpenAI chat usage through preserved prompt text Use the stored full-context prompt text for chat non-stream, stream, and retry accounting so current_input_file no longer shrinks returned prompt token counts.	2026-04-30 00:45:30 +08:00
shern-point	b96f736bd2	feat: preserve full prompt text across current_input_file rewrites Keep token accounting tied to the original prompt even after the live prompt is replaced with a neutral placeholder and hidden context file.	2026-04-30 00:45:01 +08:00
CJACK.	33f6fef015	Fix tool-call fallback on sanitized empty text and remove history wrapper tags	2026-04-29 23:04:45 +08:00
MiY	241334c658	Fix stream compatibility and vision model exposure	2026-04-29 20:23:13 +08:00
CJACK.	22160de2c4	Merge pull request #359 from NgoQuocViet2001/ai/ds2api-small-fix fix(openai): keep citation indexes one-based with zero-based references	2026-04-29 18:27:15 +08:00
NgoQuocViet2001	0cbc2c875d	fix(openai): keep citation indexes one-based	2026-04-29 15:43:09 +07:00
CJACK.	2c8409dcbb	fix docker defaults to writable /data config path and align docs	2026-04-29 13:46:22 +08:00
CJACK.	929d9a8ef7	Merge pull request #352 from shern-point/fix/tool-string-schema-protection Fix/tool type schema protection	2026-04-29 07:51:21 +08:00
shern-point	6e21714e23	test: cover Claude schema-aware tool normalization	2026-04-29 01:59:42 +08:00
shern-point	48c4f0df9f	fix: preserve runtime tool schemas in Claude tool output	2026-04-29 01:59:24 +08:00
ouqiting	28d2b0410f	feat: parse split context files in list view	2026-04-29 01:15:29 +08:00
CJACK.	685b5011e4	Merge pull request #343 from livesRan/fix-429Resend-pr 支持 reference 引用标签转链接，并兼容 0 基序号映射	2026-04-28 21:47:15 +08:00
songguoliang	15e9eb3639	支持 reference 引用标签转链接，并兼容 0 基序号映射	2026-04-28 16:42:37 +08:00
shern-point	72c8e7e9f9	test: cover responses string-protected tool arguments Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-28 13:46:43 +08:00
shern-point	b9c8e90d98	refactor: thread tool schemas through responses tool outputs Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-28 13:46:06 +08:00
shern-point	36fcba1280	test: cover chat string-protected tool arguments Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-28 13:45:35 +08:00
shern-point	801b5abce3	refactor: thread tool schemas through chat tool outputs Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>	2026-04-28 13:38:57 +08:00
CJACK	9f7b671e5e	Revert "refactor: consolidate current_input_file prompt into BuildOpenAICurrentInputContextPrompt" This reverts commit `d40888496e`.	2026-04-28 00:31:12 +08:00
CJACK	d40888496e	refactor: consolidate current_input_file prompt into BuildOpenAICurrentInputContextPrompt Extract the compacted-context prompt string into a single function in promptcompat and add a [context note] block to the injected file wrapper so the model knows the attached history is compressed context, not new instructions. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-28 00:01:14 +08:00
CJACK	28bb85ad63	refactor: replace history_split with current_input_file configuration Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-27 23:36:56 +08:00
CJACK	b82bc1311a	fix: use parent_message_id and fresh PoW headers for empty-output retry and continue Previously retry/continue requests reused the initial PoW header and lacked parent_message_id, causing them to land as disconnected root messages in the DeepSeek session instead of proper follow-up turns. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-27 21:31:51 +08:00
CJACK	0378d8c0a9	feat: add empty-output retry and Vercel auto-continue support - Auto-retry Chat/Responses streams once when upstream output is empty but not content-filtered, reusing session/token/PoW and appending a regeneration suffix to the prompt - Wire DeepSeek continue API into Vercel streams for multi-round thinking output exhaustion - Defer empty-output errors in stream finalizers to enable synthetic retry; only surface failure when the retry budget is exhausted - Track content_filter stops to avoid retry on filtered outputs - Add comprehensive tests for stream/non-stream retry, Responses retry, and content_filter no-retry - Update prompt-compatibility.md documentation Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-27 18:00:52 +08:00
CJACK	70467054c3	fix: preserve partial-update fields for current_input_file and thinking_injection, expand DSML space-separator aliases - Guard current_input_file.enabled / thinking_injection.{enabled,prompt} with hasNestedSettingsKey so partial updates don't overwrite omitted fields - Expand DSML alias support to tolerate space-separated tags (e.g. <\|dsml invoke>) alongside pipe-separated forms - Sync Go sieve, Node sieve, toolcall parser, and tests for all new DSML variants - Update API.md and toolcall-semantics.md with expanded alias coverage Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-27 15:06:44 +08:00
CJACK	6959aa2982	feat: add ETag cache optimization, code-split WebUI, and refactor XML tool scanner - Chat history: early 304 via Revision()/DetailRevision() to avoid full snapshot reads - WebUI: lazy-load tab containers with Suspense fallback - Toolstream: split tool_sieve_xml.go into tags.go and scan.go - CI: trigger on main branch, guard cross-build to dev/main pushes only - Docs: add DEVELOPER.md developer quick reference Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-27 14:37:23 +08:00
CJACK	1602c3a43c	lint	2026-04-27 13:48:55 +08:00
CJACK	90ce595325	chore: update project files	2026-04-27 02:09:11 +08:00
CJACK	40d5e3ebb5	测试DSML	2026-04-27 00:21:26 +08:00

1 2

57 Commits