Skip to content

Add talking_Xs actions and update speaking guidelines#2618

Merged
openminddev merged 1 commit into
mainfrom
add-more-action
Jun 10, 2026
Merged

Add talking_Xs actions and update speaking guidelines#2618
openminddev merged 1 commit into
mainfrom
add-more-action

Conversation

@openminddev

Copy link
Copy Markdown
Contributor

This pull request updates the behavior guidelines and action definitions for the Unitree G1 robot to introduce a new, duration-based set of talking actions for more natural and expressive speech coordination. It also removes the old generic speaking actions and clarifies gesture usage. The changes affect both configuration files and the action enumeration in the code.

Key changes:

Speaking Action Overhaul

  • Added a new set of talking actions (talking_2s, talking_4s, ..., talking_20s) that correspond to different speaking durations, and updated the guidelines to select the appropriate action based on the expected length of the spoken response. [1] [2] [3] [4] [5]
  • Removed the old generic speaking actions (speak_action, speak_action_extended) from both the configuration and the code, ensuring only the new duration-based actions are available. [1] [2] [3]

Gesture and Action Clarifications

  • Updated gesture names for consistency (e.g., rotate_hand to rotate_hands) and clarified when to use gestures versus talking actions in the guidelines. [1] [2]
  • Refined instructions to prefer the appropriate talking action while speaking, and clarified that STAND_STILL should be used only when idle or listening.

These changes make the robot's behavior more expressive and context-appropriate during conversations.

Add a Speaking Action Selection section to greeting_conversation.json5 and unitree_g1_conversation.json5 that instructs selecting a single talking_Xs action based on estimated spoken duration (talking_2s..talking_20s) and to prefer gestures when appropriate. Update the Unitree G1 system prompt to include the new talking_* actions and change ROTATE_HAND -> ROTATE_HANDS. Update ArmAction EnumValues in plugins/actions/unitree/g1/arm/zenoh.go to expose talking_2s..talking_20s and remove the old speak_action entries so runtime action enums match the new guidance.
Copilot AI review requested due to automatic review settings June 9, 2026 23:01
@openminddev openminddev requested review from a team as code owners June 9, 2026 23:01
@github-actions github-actions Bot added config Configuration files plugins go labels Jun 9, 2026
@codecov

codecov Bot commented Jun 9, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This pull request updates Unitree G1 conversation behavior by replacing generic speaking actions with duration-based talking_Xs actions, and adjusts guidance so arm/body motions better match spoken response length.

Changes:

  • Added duration-based talking actions (talking_2stalking_20s) to the Unitree G1 arm action enum.
  • Updated Unitree G1 conversation prompt guidelines to select a single talking action based on estimated speech duration, and clarified gesture vs. talking usage.
  • Removed legacy generic speaking actions from the Unitree G1 conversation configuration prompt.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File Description
plugins/actions/unitree/g1/arm/zenoh.go Extends the arm action enum with talking_Xs actions (and removes legacy speaking actions).
config/unitree_g1_conversation.json5 Updates the Unitree G1 conversation system prompt to use duration-based talking actions and clarifies gesture usage.
config/greeting_conversation.json5 Adds duration-based “Speaking Action Selection” guidance to greeting/conversation prompts.

Comment on lines +37 to +41
"talking_2s",
"talking_4s",
"talking_6s",
"talking_8s",
"talking_10s",
Comment on lines +88 to +92
Speaking Action Selection:\n\
- When generating a spoken response, estimate how long it will take to speak naturally.\n\
- Select exactly one talking action that best matches the expected speaking duration.\n\
- Use talking_2s for responses up to 2 seconds.\n\
- Use talking_4s for responses between 2 and 4 seconds.\n\
Comment on lines +231 to +235
Speaking Action Selection:\n\
- When generating a spoken response, estimate how long it will take to speak naturally.\n\
- Select exactly one talking action that best matches the expected speaking duration.\n\
- Use talking_2s for responses up to 2 seconds.\n\
- Use talking_4s for responses between 2 and 4 seconds.\n\
Comment on lines +37 to +39
"talking_2s",
"talking_4s",
"talking_6s",
@openminddev openminddev merged commit da13d1b into main Jun 10, 2026
7 checks passed
@openminddev openminddev deleted the add-more-action branch June 10, 2026 18:57
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

config Configuration files go plugins

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants