Telemetry

By default, we collect no data that would be considered personal information under GDPR and other privacy regulations. We do collect Tool’s names and Agent’s roles, so be advised not to include any personal information in the tool’s names or the Agent’s roles. Because no personal information is collected, it’s not necessary to worry about data residency. When share_crew is enabled, additional data is collected which may contain personal information if included by the user. Users should exercise caution when enabling this feature to ensure compliance with privacy regulations.

CrewAI utilizes anonymous telemetry to gather usage statistics with the primary goal of enhancing the library. Our focus is on improving and developing the features, integrations, and tools most utilized by our users.

It’s pivotal to understand that by default, NO personal data is collected concerning prompts, task descriptions, agents’ backstories or goals, usage of tools, API calls, responses, any data processed by the agents, or secrets and environment variables. When the share_crew feature is enabled, detailed data including task descriptions, agents’ backstories or goals, and other specific attributes are collected to provide deeper insights. This expanded data collection may include personal information if users have incorporated it into their crews or tasks. Users should carefully consider the content of their crews and tasks before enabling share_crew. Users can disable telemetry by setting the environment variable OTEL_SDK_DISABLED to true.

Data Explanation:

DefaultedDataReason and Specifics
YesCrewAI and Python VersionTracks software versions. Example: CrewAI v1.2.3, Python 3.8.10. No personal data.
YesCrew MetadataIncludes: randomly generated key and ID, process type (e.g., ‘sequential’, ‘parallel’), boolean flag for memory usage (true/false), count of tasks, count of agents. All non-personal.
YesAgent DataIncludes: randomly generated key and ID, role name (should not include personal info), boolean settings (verbose, delegation enabled, code execution allowed), max iterations, max RPM, max retry limit, LLM info (see LLM Attributes), list of tool names (should not include personal info). No personal data.
YesTask MetadataIncludes: randomly generated key and ID, boolean execution settings (async_execution, human_input), associated agent’s role and key, list of tool names. All non-personal.
YesTool Usage StatisticsIncludes: tool name (should not include personal info), number of usage attempts (integer), LLM attributes used. No personal data.
YesTest Execution DataIncludes: crew’s randomly generated key and ID, number of iterations, model name used, quality score (float), execution time (in seconds). All non-personal.
YesTask Lifecycle DataIncludes: creation and execution start/end times, crew and task identifiers. Stored as spans with timestamps. No personal data.
YesLLM AttributesIncludes: name, model_name, model, top_k, temperature, and class name of the LLM. All technical, non-personal data.
YesCrew Deployment attempt using crewAI CLIIncludes: The fact a deploy is being made and crew id, and if it’s trying to pull logs, no other data.
NoAgent’s Expanded DataIncludes: goal description, backstory text, i18n prompt file identifier. Users should ensure no personal info is included in text fields.
NoDetailed Task InformationIncludes: task description, expected output description, context references. Users should ensure no personal info is included in these fields.
NoEnvironment InformationIncludes: platform, release, system, version, and CPU count. Example: ‘Windows 10’, ‘x86_64’. No personal data.
NoCrew and Task Inputs and OutputsIncludes: input parameters and output results as non-identifiable data. Users should ensure no personal info is included.
NoComprehensive Crew Execution DataIncludes: detailed logs of crew operations, all agents and tasks data, final output. All non-personal and technical in nature.

“No” in the “Defaulted” column indicates that this data is only collected when share_crew is set to true.

Opt-In Further Telemetry Sharing

Users can choose to share their complete telemetry data by enabling the share_crew attribute to True in their crew configurations. Enabling share_crew results in the collection of detailed crew and task execution data, including goal, backstory, context, and output of tasks. This enables a deeper insight into usage patterns.

If you enable share_crew, the collected data may include personal information if it has been incorporated into crew configurations, task descriptions, or outputs. Users should carefully review their data and ensure compliance with GDPR and other applicable privacy regulations before enabling this feature.