Gemini CLI
ChatProduct overview
Name of Agent: Gemini CLI
Who is using it?: end user and enterprise customers for coding and prototyping
Website: (https://geminicli.com/, archived)
Category: Chat
Company & accountability
Developer: Google
For profit company?: Yes
Parent company?: Alphabet Inc
Technical capabilities & system architecture
Documention: (https://geminicli.com/docs/, archived)
Observation space: File System Tools: For interacting with files and directories (reading, writing, listing, searching, etc.).
Shell Tool (run_shell_command): For executing shell commands.
Web Fetch Tool (web_fetch): For retrieving content from URLs.
Web Search Tool (google_web_search): For searching the web.
Memory Tool (save_memory): For saving and recalling information across sessions.
To do Tool (write_to dos): For managing subtasks of complex requests.
MCP servers: MCP servers act as a bridge between the Gemini model and your local environment or other services like APIs. (source, archived)
Action space: File System Tools: For interacting with files and directories (reading, writing, listing, searching, etc.).
Shell Tool (run_shell_command): For executing shell commands.
MCP servers: MCP servers act as a bridge between the Gemini model and your local environment or other services like APIs. (source, archived)
User interface and interaction design: Chabot in CLI
User roles: Operator (issues queries, which the agent to responds to); Executor (user may take actions/make decisions based on outputs); Examiner (user can use thumbs up/down buttons to give feedback)
Component accessibility: Open source (link), for the agentic harness, the models are closed source
Autonomy & control
Autonomy level and planning depth: L1-L3: Tasks that users assigns to the agent are often narrow in scope. More complex tasks need multi-turn conversations where the user is in charge of planning. Agent always comes back to the user and awaits further instructions. Agent can create a plan and execute on it across multiple steps.
Execution monitoring, traces, and transparency: Visible (albeit summarized) CoT with a description of current task being executed
Emergency stop and shut down mechanisms and user control: User can pause/stop the agent at any time
Usage monitoring and statistics and patterns: User can see how much context is used
Ecosystem interaction
Identify to humans?: None found
Identifies technically?: None found, but presumably the same as Gemini
Safety, evaluation & impact
Third-party testing, audits, and red-teaming: None found
Any known incidents?: None found