hermes/tests at da184439db42a6ac6816d31bb0c2fedd18d93c23 - hermes - Zopu Git: Git solution

common/hermes

Files

History

Teknium da184439db execute_code: write sandbox files as UTF-8 on Windows

Second Windows-specific sandbox bug (WinError 10106 was the first):
after the env-scrub fix let the child start, it immediately failed to
import hermes_tools with:

    SyntaxError: (unicode error) 'utf-8' codec can't decode byte 0x97
                 in position 154: invalid start byte

Root cause: _execute_local wrote the generated hermes_tools.py stub and
the user's script.py via open(path, 'w') without encoding=.  On Windows
the default text-mode encoding is cp1252 (system locale), which encodes
em-dashes (used in the stub's docstrings) as 0x97.  Python then decodes
source files as UTF-8 (PEP 3120) on import, chokes on 0x97, and the
sandbox dies before any tool call.

Fix: pass encoding='utf-8' to all four file opens in the code_execution
path — the two staging writes in _execute_local (hermes_tools.py +
script.py) and the two RPC file-transport reads/writes in the generated
remote stub.  JSON is ASCII-safe for most payloads but tool results
(terminal output, web_extract content) routinely carry non-ASCII.

Tests added (4):
  - test_stub_and_script_writes_specify_utf8 — source grep guard
  - test_file_rpc_stub_uses_utf8 — generated remote stub check
  - test_stub_source_roundtrips_through_utf8 — concrete round-trip
  - test_windows_default_encoding_would_have_failed — negative control
    (skips on modern Python builds where default is already UTF-8
    compatible, but retained for platforms where the regression could
    return)

24/25 tests pass on Windows 3.11 (negative control skips because this
Python build handles em-dashes via cp1252 subset — the fix is still
correct, just the corruption path isn't always triggerable).

2026-05-08 14:27:40 -07:00

..

fix(acp): preserve assistant reasoning metadata in session persistence

2026-05-05 10:18:28 -07:00

feat(acp): pass image file attachments through as image_url parts

2026-05-07 09:24:32 -07:00

feat(computer-use): cua-driver backend, universal any-model schema

2026-05-08 11:07:38 -07:00

fix(goals): Ctrl+C during /goal loop auto-pauses the goal (#21888 )

2026-05-08 06:53:13 -07:00

feat(cron): routing intent — deliver=all fans out to every connected channel (#21495 )

2026-05-08 04:17:21 -07:00

fix(gateway): move quick-command dispatch before built-in handlers

2026-05-04 01:39:23 -07:00

environments/benchmarks

fix(security): consolidated security hardening — SSRF, timing attack, tar traversal, credential leakage (#5944 )

2026-04-07 17:28:37 -07:00

fix: streaming tool call parsing, error handling, and fake HA state mutation

2026-03-14 14:27:20 +03:00

fix(teams-pipeline): add skill asset and fix async test env

2026-05-08 12:41:41 -07:00

fix(windows): prefer npm.cmd over npm.ps1, skip .py argv0 in relaunch

2026-05-08 14:27:40 -07:00

fix(resume): redirect --resume to the descendant that actually holds the messages

2026-04-24 03:04:42 -07:00

feat(honcho): explain why when honcho_profile returns an empty card

2026-04-27 12:37:33 -07:00

fix(discord): strip RTP padding before DAVE/Opus decode (#11267 )

2026-04-16 16:50:15 -07:00

openviking_plugin

fix(openviking): pre-check fs/stat to route file URIs before hitting directory-only endpoints

2026-04-30 02:35:29 -07:00

fix(teams-pipeline): fill in missing delivery URL in adapter-reuse test

2026-05-08 12:00:09 -07:00

feat(providers): make all 33 providers pluggable under plugins/model-providers/

2026-05-05 13:40:01 -07:00

fix(computer-use): harden image-rejection fallback + AUTHOR_MAP

2026-05-08 11:07:38 -07:00

fix(google-workspace): restore required_credential_files in SKILL.md (#16452 )

2026-05-04 12:43:14 -07:00

feat(kanban): durable multi-profile collaboration board (#17805 )

2026-04-30 13:36:47 -07:00

execute_code: write sandbox files as UTF-8 on Windows

2026-05-08 14:27:40 -07:00

fix(tui): close slash parity gaps with CLI (#20339 )

2026-05-05 15:42:39 -05:00

docs(skills): explain restoring bundled skills

2026-05-05 13:46:20 -07:00

__init__.py

…

conftest.py

fix(tests): avoid asyncio DeprecationWarning in event loop fixture on 3.12+

2026-05-07 07:05:05 -07:00

run_interrupt_test.py

fix: thread safety for concurrent subagent delegation (#1672 )

2026-03-17 02:53:33 -07:00

test_account_usage.py

feat(account-usage): add per-provider account limits module

2026-04-21 01:56:35 -07:00

test_atomic_replace_symlinks.py

refactor: consolidate symlink-safe atomic replace into shared helper

2026-04-28 04:58:22 -07:00

test_base_url_hostname.py

security(runtime_provider): close OLLAMA_API_KEY substring-leak sweep miss (#13522 )

2026-04-21 06:06:16 -07:00

test_batch_runner_checkpoint.py

test: regression coverage for checkpoint dedup and inf/nan coercion

2026-04-24 14:32:21 -07:00

test_cli_file_drop.py

fix(tui): improve macOS paste and shortcut parity

2026-04-21 08:00:00 -07:00

test_cli_manual_compress.py

test(cli): regression test for manual /compress system_message

2026-04-28 05:21:49 -07:00

test_cli_skin_integration.py

fix(ci): stabilize main test suite regressions (#17660 )

2026-04-29 23:18:55 -07:00

test_ctx_halving_fix.py

fix(tests): fix 78 CI test failures and remove dead test (#9036 )

2026-04-13 10:50:24 -07:00

test_empty_model_fallback.py

fix: fall back to provider's default model when model config is empty (#8303 )

2026-04-12 03:53:30 -07:00

test_evidence_store.py

feat: add OSS Security Forensics skill (Skills Hub) (#1482 )

2026-03-15 21:59:53 -07:00

test_get_tool_definitions_cache_isolation.py

fix(tools): isolate get_tool_definitions quiet_mode cache + dedup LCM injection (#17335 )

2026-04-30 04:32:06 -07:00

test_hermes_constants.py

test(hermes_constants): cover parse_reasoning_effort()

2026-05-07 09:59:07 -07:00

test_hermes_home_profile_warning.py

fix(constants): warn once when get_hermes_home() falls back under an active profile (#18746 )

2026-05-02 01:49:55 -07:00

test_hermes_logging.py

fix(logging): attach gateway log after cli init

2026-04-26 19:01:26 -07:00

test_hermes_state.py

fix(telegram): polish topic mode — CASCADE, General-topic handling, rename guard, debounce

2026-05-04 12:07:17 -07:00

test_honcho_client_config.py

feat(memory): pluggable memory provider interface with profile isolation, review fixes, and honcho CLI restoration (#4623 )

2026-04-02 15:33:51 -07:00

test_install_sh_pythonpath_sanitization.py

fix: harden install.sh against inherited Python env leakage

2026-05-06 04:02:02 -07:00

test_install_sh_setup_wizard_tty_probe.py

fix(install): widen /dev/tty open-probe to sibling gates (#16746 )

2026-04-28 06:45:55 -07:00

test_install_sh_termux_network_prereqs.py

fix: strengthen termux install network prerequisites

2026-05-07 13:04:08 -07:00

test_ipv4_preference.py

feat: add network.force_ipv4 config to fix IPv6 timeout issues (#8196 )

2026-04-11 23:12:11 -07:00

test_lazy_session_regressions.py

fix: resolve lazy session creation regressions (#18370 fallout) (#20363 )

2026-05-06 01:11:49 +05:30

test_mcp_serve.py

fix(mcp): unwrap platforms key in channels_list

2026-05-07 13:41:16 -07:00

test_mini_swe_runner.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_minimax_model_validation.py

fix(models): validate MiniMax models against static catalog (#12611 , #12460 , #12399 , #12547 )

2026-04-19 22:44:47 -07:00

test_minimax_oauth.py

test(cli): cover minimax-oauth resolution, refresh, menu wiring

2026-04-29 09:53:42 -07:00

test_minisweagent_path.py

chore: remove all remaining mini-swe-agent references

2026-03-24 08:19:23 -07:00

test_model_picker_scroll.py

fix: CLI/UX batch — ChatConsole errors, curses scroll, skin-aware banner, git state banner (#5974 )

2026-04-07 17:59:42 -07:00

test_model_tools_async_bridge.py

fix(model_tools): cancel coroutine on timeout so worker thread exits + log full traceback

2026-04-29 05:00:40 -07:00

test_model_tools.py

fix(plugins): stop firing pre_tool_call hook twice per tool execution (#17611 )

2026-04-29 12:43:39 -07:00

test_ollama_num_ctx.py

fix: provider/model resolution — salvage 4 PRs + MiniMax aux URL fix (#5983 )

2026-04-07 22:23:28 -07:00

test_packaging_metadata.py

chore: prepare Hermes for Homebrew packaging (#4099 )

2026-03-30 17:34:43 -07:00

test_plugin_skills.py

fix(skills): support category-qualified local skill names

2026-05-05 10:15:31 -07:00

test_process_loop_event_loop_warning.py

fix(cli): replace get_event_loop() with get_running_loop() to silence RuntimeWarning in process_loop thread (#19285 )

2026-05-07 06:35:54 -07:00

test_project_metadata.py

build(deps): add qrcode to dingtalk + feishu extras (parity with messaging) (#11627 )

2026-04-17 13:31:53 -07:00

test_retry_utils.py

feat(agent): add jittered retry backoff

2026-04-08 00:41:36 -07:00

test_sql_injection.py

fix(security): eliminate SQL string formatting in execute() calls

2026-03-19 15:16:35 +01:00

test_subprocess_home_isolation.py

fix: per-profile subprocess HOME isolation (#4426 ) (#7357 )

2026-04-10 13:37:45 -07:00

test_termux_all_extra_compat.py

fix: add termux-all install profile and safe fallbacks

2026-05-07 13:04:08 -07:00

test_timezone.py

test: speed up slow tests (backoff + subprocess + IMDS network) (#11797 )

2026-04-17 14:21:22 -07:00

test_toolset_distributions.py

…

test_toolsets.py

fix: merge plugin tools into builtin toolsets

2026-05-05 10:14:17 -07:00

test_trajectory_compressor_async.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_trajectory_compressor.py

fix(kimi): omit temperature entirely for Kimi/Moonshot models (#13157 )

2026-04-20 12:23:05 -07:00

test_transform_llm_output_hook.py

test+docs: cover transform_llm_output hook + release author map

2026-05-07 05:46:05 -07:00

test_transform_tool_result_hook.py

test: stop testing mutable data — convert change-detectors to invariants (#13363 )

2026-04-20 23:20:33 -07:00

test_tui_gateway_server.py

Merge pull request #20942 from NousResearch/austin/fix/personality

2026-05-07 18:54:29 -04:00

test_utils_truthy_values.py

Gate tool-gateway behind an env var, so it's not in users' faces until we're ready. Even if users enable it, it'll be blocked server-side for now, until we unlock for non-admin users on tool-gateway.

2026-03-30 13:28:10 +09:00

test_yuanbao_integration.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00

test_yuanbao_markdown.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00

test_yuanbao_pipeline.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00

test_yuanbao_proto.py

yuanbao platform (#16298 )

2026-04-26 18:50:49 -07:00