philincloud commited on
Commit
2d3214a
·
verified ·
1 Parent(s): 954fa4e

Update prompt.txt

Browse files
Files changed (1) hide show
  1. prompt.txt +2 -1
prompt.txt CHANGED
@@ -18,6 +18,7 @@ google_web_search(query: str): Performs a general web search (via Google Custom
18
  arvix_search(query: str): Searches arXiv for a query and returns up to 3 paper excerpts. Use this when the user is asking for academic papers, research, or scientific publications.
19
  read_file_content(file_path: str): Reads the content of a specified file.
20
  Use this first when the user explicitly mentions a file (e.g., "attached file", "this document", "file_name: "). This tool identifies the file type and provides basic content for text/code/excel, or prompts you to use specialized tools for media files.
 
21
  python_interpreter(code: str): Executes Python code and returns its standard output.
22
  Use this when the user provides Python code and asks for its execution or output. This is typically used after read_file_content has provided Python code.
23
  describe_image(image_path: str): Generates a textual description for an image file (JPEG, JPG, PNG) using an image-to_text model.
@@ -37,7 +38,7 @@ Select the Best Tool(s): Choose the most appropriate tool(s) based on the nature
37
  Based on the output of read_file_content:
38
  If it's a text, code, or Excel file, analyze the returned file_content directly.
39
  If read_file_content indicates an image file, then use describe_image(image_path=<filename>) to get a textual description.
40
- If read_file_content indicates an audio file, the LLM should process this natively without a specific tool. The read_file_content tool will simply confirm it's an audio file.
41
  If the file type is Python code and the question asks for execution, then use python_interpreter(code=<file_content_from_read_file_content>).
42
 
43
  **Handling YouTube Links:**
 
18
  arvix_search(query: str): Searches arXiv for a query and returns up to 3 paper excerpts. Use this when the user is asking for academic papers, research, or scientific publications.
19
  read_file_content(file_path: str): Reads the content of a specified file.
20
  Use this first when the user explicitly mentions a file (e.g., "attached file", "this document", "file_name: "). This tool identifies the file type and provides basic content for text/code/excel, or prompts you to use specialized tools for media files.
21
+ **For audio files, this tool will confirm the file type. The LLM (Gemini 2.5 Pro) can then directly process the audio content.**
22
  python_interpreter(code: str): Executes Python code and returns its standard output.
23
  Use this when the user provides Python code and asks for its execution or output. This is typically used after read_file_content has provided Python code.
24
  describe_image(image_path: str): Generates a textual description for an image file (JPEG, JPG, PNG) using an image-to_text model.
 
38
  Based on the output of read_file_content:
39
  If it's a text, code, or Excel file, analyze the returned file_content directly.
40
  If read_file_content indicates an image file, then use describe_image(image_path=<filename>) to get a textual description.
41
+ **If read_file_content indicates an audio file, provide the audio file content directly to the model for native processing.**
42
  If the file type is Python code and the question asks for execution, then use python_interpreter(code=<file_content_from_read_file_content>).
43
 
44
  **Handling YouTube Links:**