Skip to main content

AI

AI Detect App Language​

Restrictions: "none"

AI Detect App Language

No docs available

Inputs​

NameTypeDescriptionRequiredDynamic
modelenumThe AI model to use for vision processingNoNo

Outputs​

NameTypeDescription
defaultstringThe language of the app as a two-letter ISO 639-1 code

Ask AI (Yes/No)​

Restrictions: "none"

Ask AI (Yes/No)

No docs available

Inputs​

NameTypeDescriptionRequiredDynamic
questiontextA yes/no question to ask the AI about the screenYesNo
regionregionSpecify an optional region to crop the screen to before passing it to the AINoNo
modelenumThe AI model to use for vision processingNoNo

Outputs​

NameTypeDescription
defaultbooleantrue if the AI answers yes to the question, false otherwise
viewpointimageThe viewpoint of the AI

Prompt AI​

Restrictions: "none"

Prompt AI

No docs available

Inputs​

NameTypeDescriptionRequiredDynamic
prompttextPlain english prompt to the AIYesNo
regionregionSpecify an optional region to crop the screen to before passing it to the AINoNo
modelenumThe AI model to use for vision processingNoNo

Outputs​

NameTypeDescription
defaultstringOutput text generated by the AI
viewpointimageThe viewpoint of the AI

AI Navigate (Deprecated)​

Restrictions: "none"

AI Navigate (Deprecated)

Navigates to a specific UI element using AI-generated instructions.

Inputs​

NameTypeDescriptionRequiredDynamic
targetstringName of the UI element to navigate toYesNo
regionregionSpecify an optional region to crop the screen to before passing it to the AINoNo
additionalInstructionstextAny additional context or instructions for the LLM. This could be hints about the UI, like: 'The selected element is highlighed by a yellow color'.NoNo
waitAfterInputnumberWait after executing the navigation sequence before the next step.NoNo
confirmbooleanSelect item after completing navigationYesNo
captureBeforeConfirmbooleanCapture a screenshot before pressing the OK buttonYesNo
promptTypeenumThe method to use for the navigationYesNo
modelenumThe AI model to use for vision processingNoNo

Outputs​

NameTypeDescription
defaultstringOutput text generated by the AI
llmResponsestringResponse from LLM for debuggin
viewpointimageThe viewpoint of the AI
screenBeforeConfirmimageThe screen show before press the OK button

AI Navigation Agent (Deprecated)​

Restrictions: "none"

AI Navigation Agent (Deprecated)

Navigates to a specific UI element using AI-generated instructions with recursive prompting.

Inputs​

NameTypeDescriptionRequiredDynamic
targetstringName of the UI element to navigate toYesNo
regionregionSpecify an optional region to crop the screen to before passing it to the AINoNo
additionalInstructionstextAny additional context or instructions for the LLM. This could be hints about the UI, like: 'The selected element is highlighed by a yellow color'.NoNo
commandsPerIterationnumberNumber of commands to execute in each iterationYesNo
maxIterationsnumberMaximum number of iterations before stoppingYesNo
waitAfterInputnumberWait after executing each command in the navigation sequence.NoNo
modelenumThe AI model to use for vision processingNoNo

Outputs​

NameTypeDescription
defaultstringOutput text generated by the AI
llmResponsestringResponse from LLM for debugging
viewpointimageThe final viewpoint of the AI

AI Agent​

Restrictions: "none"

AI Agent

AI agent that can navigate and interact with TV devices using vision and control capabilities.

Inputs​

NameTypeDescriptionRequiredDynamic
goaltextDescribe what you want the AI agent to accomplish.

Examples:

  • "Navigate to Settings" (just move focus, don't press)
  • "Open Settings" (navigate and press SELECT)
  • "Select the Netflix app"
  • "Find the search icon in the top right" | Yes | No | | screenshot | boolean | Allow the AI agent to capture screenshots for visual analysis | No | No | | remoteControlActions | remote_keys | Select which remote control commands the AI agent can use | No | No | | sleep | boolean | Allow the AI agent to wait for a number of seconds (e.g. after launching an app or pressing a button) | No | No | | maxAttempts | number | Maximum number of navigation attempts before giving up | No | No | | screenshotDelayMs | number | Ms to wait before each capture. Leave empty for no delay. | No | No | | model | enum | The AI model to use for vision processing and decision making | No | No |

Outputs​

NameTypeDescription
successbooleanWhether the navigation successfully completed the goal
actionsstringSequence of actions taken (e.g., 'up,right,down,select')
attemptsnumberNumber of navigation attempts made
finalScreenimageScreenshot after navigation completion
reasoningstringAI's explanation of what it did and why
defaultnull

AI Detect on Screen​

Restrictions: "none"

AI Detect on Screen

No docs available

Inputs​

NameTypeDescriptionRequiredDynamic
element_typetextThe type of element to look for on the screen, e.g. 'button' or 'text'YesNo
texttextThe text of the element to look for on the screen, e.g. 'Login' or 'Submit'NoNo
regionregionSpecify an optional region to crop the screen to before passing it to the AINoNo
modelenumThe AI model to use for vision processingNoNo

Outputs​

NameTypeDescription
defaultbooleanWhether the element was detected or not
viewpointimageThe viewpoint of the AI