CJACK
c099a6f7bf
feat: add unified response history session management across Claude, Gemini, and OpenAI API backends
...
Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com >
2026-05-03 17:24:38 +08:00
CJACK
1286b02247
refactor: remove legacy compatibility configuration and UI components
2026-05-03 04:14:19 +08:00
CJACK
a901250de7
refactor: replace bufio.Scanner with bufio.Reader for SSE stream parsing and track emitted text to prevent redundant output blocks
2026-05-02 23:50:35 +08:00
CJACK
dc5bffdf89
refactor: centralize assistant turn semantics and stream accumulation into new assistantturn and completionruntime packages
2026-05-02 23:28:43 +08:00
王
d407ccb773
perf(streaming): optimize TTFT and reduce buffering latency
...
Core changes:
- stream.go: New accumulation buffer architecture with scanner goroutine
+ select loop, MinChars=16, MaxWait=10ms, first-flush-immediate
- dedupe.go: Add TrimContinuationOverlapFromBuilder to avoid string copies
- claude/stream_runtime_core.go: Integrate toolstream for incremental text
- claude/stream_runtime_finalize.go: toolstream flush support
- stream_emitter.js: Reduce DeltaCoalescer thresholds (160->16 chars, 80->20ms)
- empty_retry: Add thinking-aware empty output detection
- Fix reasoning_content leak and finish_reason=null in edge cases
- Fix tail content truncation when max_tokens exceeded
Tests: sync test expectations with upstream for thinking content
2026-05-02 20:28:30 +08:00
shern-point
d3018c281b
feat: use tokenizer-based counting in Claude token paths
...
Unify Claude count_tokens, legacy stream accounting, and legacy render usage with preserved prompt text so Claude stops falling back to lossy message formatting.
2026-04-30 00:46:04 +08:00
shern-point
48c4f0df9f
fix: preserve runtime tool schemas in Claude tool output
2026-04-29 01:59:24 +08:00
CJACK
abc96a37d8
refactor backend API structure
2026-04-26 06:58:20 +08:00