Cloud Vision API documentation

Cloud Vision API allows developers to easily integrate vision detection features within applications, including image labeling, face and landmark detection, optical character recognition (OCR), and tagging of explicit content.

  • Get access to Gemini 2.0 Flash Thinking
  • Free monthly usage of popular products, including AI APIs and BigQuery
  • No automatic charges, no commitment
View free product offers

Keep exploring with 20+ always-free products

Access 20+ free products for common use cases, including AI APIs, VMs, data warehouses, and more.

Explore self-paced training from Google Cloud Skills Boost, use cases, reference architectures, and code samples with examples of how to use and connect Google Cloud services.
training
Training and tutorials

Learn how to perform optical character recognition (OCR) on Google Cloud Platform. This tutorial demonstrates how to upload image files to Google Cloud Storage, extract text from the images using the Google Cloud Vision API, translate the text using the Google Cloud Translation API, and save your translations back to Cloud Storage.

training
Training and tutorials

Quickly deploy your first function without any local setup.

training
Training and tutorials

This one-day instructor-led class introduces participants to the big data and machine learning capabilities of Google Cloud. It provides a quick overview of Google Cloud and a deeper dive into the data processing capabilities.

training
Training and tutorials

React to Cloud Storage changes with a function that processes an image using the Vision API to extract text and then pass it to other services.

training
Training and tutorials

Automatically classify data uploaded to Cloud Storage using Pub/Sub, Cloud Functions, and the Data Loss Prevention API.

code sample
Code Samples

Run offline (asynchronous) detection services and annotation of a large batch of image files using any Vision feature type.

Java Node.js Python Ruby

code sample
Code Samples

Set a specific region to store and process resources used for an Optical Character Recognition (OCR) request.

C# Go Java Node.js PHP Python Ruby

code sample
Code Samples

Demonstrates using the Google Cloud Vision API and ImageMagick to detect and blur offensive images that get uploaded to a Cloud Storage bucket.

Node.js Python Go Java

code sample
Code Samples

Use Vision API, Translation API, Text-to-Speech API to detect text in an image, personalize translations, and generate synthetic speech from the translated text.

Python

code sample
Code Samples

Use Vision API to identify text in an image, and then annotate an image based on the text that is detected.

Python

Related videos

Create an account to evaluate how our products perform in real-world scenarios.
New customers also get $300 in free credits to run, test, and deploy workloads.