Skip to content

fix: replace decommissioned kimi models with kimi-k2p5#447

Merged
SunnySoldier357 merged 5 commits intomainfrom
sandeep/fix-kimi-model
Apr 24, 2026
Merged

fix: replace decommissioned kimi models with kimi-k2p5#447
SunnySoldier357 merged 5 commits intomainfrom
sandeep/fix-kimi-model

Conversation

@SunnySoldier357
Copy link
Copy Markdown
Collaborator

@SunnySoldier357 SunnySoldier357 commented Apr 24, 2026

Summary

  • Replace decommissioned kimi-k2-instruct-0905 and kimi-k2-thinking with kimi-k2p5 across all tests and quickstart configs — the old models are no longer deployed on serverless and return 404s
  • Add "reasoning_effort": "none" to non-quality-sensitive test completion params to match the old instruct model's non-reasoning behavior
  • Remove test_basic_coding and test_hallucination pytest tests and their datasets — these tested model quality, not SDK correctness, making them fragile CI gates

Test plan

  • All CI shards should pass (Shard 1 was failing with score thresholds, Shard 2 was failing with 404s on kimi-k2-thinking)

Made with Cursor

@SunnySoldier357 SunnySoldier357 self-assigned this Apr 24, 2026
Copy link
Copy Markdown

@cursor cursor Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Fix All in Cursor

❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, have a team admin enable autofix in the Cursor dashboard.

Reviewed by Cursor Bugbot for commit 14563f6. Configure here.

Comment thread eval_protocol/benchmarks/test_frozen_lake.py
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 14563f6e37

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread tests/pytest/test_openenv_browsergym_basic.py
@SunnySoldier357 SunnySoldier357 changed the title fix: replace decommissioned kimi-k2-instruct-0905 with kimi-k2p5 fix: replace decommissioned kimi models with kimi-k2p5 Apr 24, 2026
@SunnySoldier357 SunnySoldier357 merged commit b3b02c8 into main Apr 24, 2026
17 checks passed
@SunnySoldier357 SunnySoldier357 deleted the sandeep/fix-kimi-model branch April 24, 2026 21:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants