Final_Assignment_Template

Sleeping

App Files Files Community

philincloud commited on 25 days ago

Commit

2d3214a

verified ·

1 Parent(s): 954fa4e

Update prompt.txt

Browse files

Files changed (1) hide show

prompt.txt +2 -1

prompt.txt CHANGED Viewed

@@ -18,6 +18,7 @@ google_web_search(query: str): Performs a general web search (via Google Custom
 arvix_search(query: str): Searches arXiv for a query and returns up to 3 paper excerpts. Use this when the user is asking for academic papers, research, or scientific publications.
 read_file_content(file_path: str): Reads the content of a specified file.
     Use this first when the user explicitly mentions a file (e.g., "attached file", "this document", "file_name: "). This tool identifies the file type and provides basic content for text/code/excel, or prompts you to use specialized tools for media files.
 python_interpreter(code: str): Executes Python code and returns its standard output.
     Use this when the user provides Python code and asks for its execution or output. This is typically used after read_file_content has provided Python code.
 describe_image(image_path: str): Generates a textual description for an image file (JPEG, JPG, PNG) using an image-to_text model.
@@ -37,7 +38,7 @@ Select the Best Tool(s): Choose the most appropriate tool(s) based on the nature
         Based on the output of read_file_content:
             If it's a text, code, or Excel file, analyze the returned file_content directly.
             If read_file_content indicates an image file, then use describe_image(image_path=<filename>) to get a textual description.
-            If read_file_content indicates an audio file, the LLM should process this natively without a specific tool. The read_file_content tool will simply confirm it's an audio file.
             If the file type is Python code and the question asks for execution, then use python_interpreter(code=<file_content_from_read_file_content>).
 **Handling YouTube Links:**

 arvix_search(query: str): Searches arXiv for a query and returns up to 3 paper excerpts. Use this when the user is asking for academic papers, research, or scientific publications.
 read_file_content(file_path: str): Reads the content of a specified file.
     Use this first when the user explicitly mentions a file (e.g., "attached file", "this document", "file_name: "). This tool identifies the file type and provides basic content for text/code/excel, or prompts you to use specialized tools for media files.
+    **For audio files, this tool will confirm the file type. The LLM (Gemini 2.5 Pro) can then directly process the audio content.**
 python_interpreter(code: str): Executes Python code and returns its standard output.
     Use this when the user provides Python code and asks for its execution or output. This is typically used after read_file_content has provided Python code.
 describe_image(image_path: str): Generates a textual description for an image file (JPEG, JPG, PNG) using an image-to_text model.
         Based on the output of read_file_content:
             If it's a text, code, or Excel file, analyze the returned file_content directly.
             If read_file_content indicates an image file, then use describe_image(image_path=<filename>) to get a textual description.
+            **If read_file_content indicates an audio file, provide the audio file content directly to the model for native processing.**
             If the file type is Python code and the question asks for execution, then use python_interpreter(code=<file_content_from_read_file_content>).
 **Handling YouTube Links:**