Extract structured content from web pages using the Tavily API
TavilyExtractorTool
allows CrewAI agents to extract structured content from web pages using the Tavily API. It can process single URLs or lists of URLs and provides options for controlling the extraction depth and including images.
TavilyExtractorTool
, you need to install the tavily-python
library:
TavilyExtractorTool
within a CrewAI agent:
TavilyExtractorTool
accepts the following arguments:
urls
(Union[List[str], str]): Required. A single URL string or a list of URL strings to extract data from.include_images
(Optional[bool]): Whether to include images in the extraction results. Defaults to False
.extract_depth
(Literal[“basic”, “advanced”]): The depth of extraction. Use "basic"
for faster, surface-level extraction or "advanced"
for more comprehensive extraction. Defaults to "basic"
.timeout
(int): The maximum time in seconds to wait for the extraction request to complete. Defaults to 60
.extract_depth
used.
Common response elements include:
include_images=True
)