Vision AI
Fully managed production environment to create your own computer vision applications.
Benefits
Accelerate Time-to-Value: Effortlessly construct, deploy, and oversee computer vision applications tailored to your distinct business requirements. Leverage pre-trained APIs, AutoML, and custom models to streamline development and reduce complexity.
Versatility for Diverse Needs: Address a range of applications and skill levels. Choose between plug-and-play analytics through APIs, custom machine learning models, or comprehensive end-to-end development environments, all available within our vision portfolio.
Assured quality from the leader in vision of: Benefit from investments in vision across our portfolio. Vision offerings have received the highest ratings from several analyst firms.
Detect Text In Images (text_detection)
text_detection)
Text Detection (Image)
POST
https://api.autogon.ai/api/v1/services/vision-ai/
TEXT_DETECTION
detects and extracts text from any image. For example, a photograph might contain a street sign or traffic sign. The JSON includes the entire extracted string, as well as individual words, and their bounding boxes.
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
operation* | String | operation to be performed: |
image* | File | image file to be processed |
Detect Text In Documents (document_text_detection
)
document_text_detection
) Text Detection (Documents)
POST
https://api.autogon.ai/api/v1/services/vision-ai/
DOCUMENT_TEXT_DETECTION
also extracts text from an image, but the response is optimized for dense text and documents. The JSON includes page, block, paragraph, word, and break information.
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
operation | String | operation to be performed: |
image | File | image to be processed |
Detect Labels In Images (label_detection)
label_detection)
Label Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
LABEL_DETECTION
can identify general objects, locations, activities, animal species, products, and more.
Headers
Name | Type | Description |
---|---|---|
Content-Type | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
image* | File | image to be processed |
operation* | String | operation to be performed: |
Detect Landmarks In Images (landmark_detection)
landmark_detection)
Landmark Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
LANDMARK_DETECTION
detects popular natural and human-made structures within an image.
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
image* | File | image to be processed |
operation* | String | operation to be performed: |
Detect Logo In Image (logo_detection)
logo_detection)
Logo Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
LOGO_DETECTION
detects popular product logos within an image.
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
image* | File | image to be processed |
operation* | String | operation to be performed: |
Detect Web entities and pages (web_detection
) (Deprecated)
web_detection
) (Deprecated)Web Detection
POST
https://api.autogon.ai/api/v1/services/vision-ai/
WEB_DETECTION
detects the most likely owners of faces in an image and also Web references to an image.
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
image* | File | image to be processed |
operation* | String | operation to be performed: |
Detect Objects in Images (object_detection)
POST
https://api.autogon.ai/api/v1/services/vision-ai/
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
image* | File | image to be processed |
operation* | String | Operation to be performed (object_detection) |
Image Generation
Creates an image given a prompt.
POST
http://api.autogon.ai/services/image-generation/
Given a prompt and/or an input image, this endpoint will generate a new image.
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | application/json |
Request Body
Name | Type | Description |
---|---|---|
prompt* | String | A text description of the desired image(s). The maximum length is 1000 characters. |
output_size | String | The size of the generated images. Must be one of |
License Plate Detector
Detects license plate in a given image
POST
http://api.autogon.ai/services/license-plate-detection/
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | application/json |
Request Body
Name | Type | Description |
---|---|---|
image_urls* | Array | An array of image urls with images of license plates |
confidence_thresh | Float | Default: 0.5 |
overlap_thresh | Float | Default: 0.5 |
Motion Detection
Provides real time, accurate motion tracking for video streams
POST
http://api.autogon.ai/services/motion-detection/
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | application/json |
Request Body
Name | Type | Description |
---|---|---|
video* | File | video file to detect motion |
Stable Diffusion (Deprecated)
Generate Images from Text
POST
https://api.autogon.ai/api/v1/services/stable-diffusion/
Headers
Name | Type | Description |
---|---|---|
Content-Type* | application/json |
Request Body
Name | Type | Description |
---|---|---|
text* | String | Short description of image to be generated |
Image Captioning (Deprecated)
Analyzes an image and generates captions for images
POST
https://api.autogon.ai/api/v1/services/image-captioning/
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
image* | File | image to be captioned |
image_url* | String | url of the image to be captioned (should be downloadable) |
Document Question and Answering (Deprecated)
Leverage AI models capable of answering questions based on the content of a given document.
POST
https://api.autogon.ai/api/v1/services/document-qa/
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
question* | String | |
document_url | String | |
document | File |
Visual Question and Answering (Deprecated)
Generate accurate answers about an image based on the visual content of the image
POST
https://api.autogon.ai/api/v1/services/vilt-vq/
Headers
Name | Type | Description |
---|---|---|
Content-Type* | String | multipart/form-data |
Request Body
Name | Type | Description |
---|---|---|
text* | String | |
image | File | |
image_url | String |
Last updated