Vision AI
Fully managed production environment to create your own computer vision applications.
Benefits
Accelerate Time-to-Value: Effortlessly construct, deploy, and oversee computer vision applications tailored to your distinct business requirements. Leverage pre-trained APIs, AutoML, and custom models to streamline development and reduce complexity.
Versatility for Diverse Needs: Address a range of applications and skill levels. Choose between plug-and-play analytics through APIs, custom machine learning models, or comprehensive end-to-end development environments, all available within our vision portfolio.
Assured quality from the leader in vision of: Benefit from investments in vision across our portfolio. Vision offerings have received the highest ratings from several analyst firms.
Detect Text In Images (text_detection)
text_detection)
Text Detection (Image)
POST
https://api.autogon.ai/api/v1/services/vision-ai/
TEXT_DETECTION
detects and extracts text from any image. For example, a photograph might contain a street sign or traffic sign. The JSON includes the entire extracted string, as well as individual words, and their bounding boxes.
Headers
Content-Type*
String
multipart/form-data
Request Body
operation*
String
operation to be performed: text_detection
image*
File
image file to be processed
Detect Text In Documents (document_text_detection
)
document_text_detection
) Text Detection (Documents)
POST
https://api.autogon.ai/api/v1/services/vision-ai/
DOCUMENT_TEXT_DETECTION
also extracts text from an image, but the response is optimized for dense text and documents. The JSON includes page, block, paragraph, word, and break information.
Headers
Content-Type*
String
multipart/form-data
Request Body
operation
String
operation to be performed: document_text_detection
image
File
image to be processed
Detect Labels In Images (label_detection)
label_detection)
Label Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
LABEL_DETECTION
can identify general objects, locations, activities, animal species, products, and more.
Headers
Content-Type
String
multipart/form-data
Request Body
image*
File
image to be processed
operation*
String
operation to be performed: label_detection
Detect Landmarks In Images (landmark_detection)
landmark_detection)
Landmark Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
LANDMARK_DETECTION
detects popular natural and human-made structures within an image.
Headers
Content-Type*
String
multipart/form-data
Request Body
image*
File
image to be processed
operation*
String
operation to be performed: landmark_detection
Detect Logo In Image (logo_detection)
logo_detection)
Logo Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
LOGO_DETECTION
detects popular product logos within an image.
Headers
Content-Type*
String
multipart/form-data
Request Body
image*
File
image to be processed
operation*
String
operation to be performed: logo_detection
Detect Web entities and pages (web_detection
) (Deprecated)
web_detection
) (Deprecated)Web Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
WEB_DETECTION
detects the most likely owners of faces in an image and also Web references to an image.
Headers
Content-Type*
String
multipart/form-data
Request Body
image*
File
image to be processed
operation*
String
operation to be performed: web_detection
Detect Objects in Images (object_detection)
POST
https://api.autogon.ai/api/v1/services/vision-ai/
Headers
Content-Type*
String
multipart/form-data
Request Body
image*
File
image to be processed
operation*
String
Operation to be performed (object_detection)
Image Generation
Creates an image given a prompt.
POST
http://api.autogon.ai/services/image-generation/
Given a prompt and/or an input image, this endpoint will generate a new image.
Headers
Content-Type*
String
application/json
Request Body
prompt*
String
A text description of the desired image(s). The maximum length is 1000 characters.
output_size
String
The size of the generated images. Must be one of 256x256
, 512x512
, or 1024x1024, defaults to 512x512.
License Plate Detector
Detects license plate in a given image
POST
http://api.autogon.ai/services/license-plate-detection/
Headers
Content-Type*
String
application/json
Request Body
image_urls*
Array
An array of image urls with images of license plates
confidence_thresh
Float
Default: 0.5
overlap_thresh
Float
Default: 0.5
Motion Detection
Provides real time, accurate motion tracking for video streams
POST
http://api.autogon.ai/services/motion-detection/
Headers
Content-Type*
String
application/json
Request Body
video*
File
video file to detect motion
Stable Diffusion (Deprecated)
Generate Images from Text
POST
https://api.autogon.ai/api/v1/services/stable-diffusion/
Headers
Content-Type*
application/json
Request Body
text*
String
Short description of image to be generated
Image Captioning (Deprecated)
Analyzes an image and generates captions for images
POST
https://api.autogon.ai/api/v1/services/image-captioning/
Headers
Content-Type*
String
multipart/form-data
Request Body
image*
File
image to be captioned
image_url*
String
url of the image to be captioned (should be downloadable)
Document Question and Answering (Deprecated)
Leverage AI models capable of answering questions based on the content of a given document.
POST
https://api.autogon.ai/api/v1/services/document-qa/
Headers
Content-Type*
String
multipart/form-data
Request Body
question*
String
document_url
String
document
File
Visual Question and Answering (Deprecated)
Generate accurate answers about an image based on the visual content of the image
POST
https://api.autogon.ai/api/v1/services/vilt-vq/
Headers
Content-Type*
String
multipart/form-data
Request Body
text*
String
image
File
image_url
String
Last updated