Cloud Vision API documentation
Cloud Vision API allows developers to easily integrate vision detection features within applications, including image labeling, face and landmark detection, optical character recognition (OCR), and tagging of explicit content.
Start your proof of concept with $300 in free credit
- Get access to Gemini 2.0 Flash Thinking
- Free monthly usage of popular products, including AI APIs and BigQuery
- No automatic charges, no commitment
Keep exploring with 20+ always-free products
Access 20+ free products for common use cases, including AI APIs, VMs, data warehouses, and more.
Documentation resources
Reference
Resources
Related resources
OCR tutorial
Learn how to perform optical character recognition (OCR) on Google Cloud Platform. This tutorial demonstrates how to upload image files to Google Cloud Storage, extract text from the images using the Google Cloud Vision API, translate the text using the Google Cloud Translation API, and save your translations back to Cloud Storage.
Create a simple Hello, World! function in the console
Quickly deploy your first function without any local setup.
Big data and ML fundamentals
This one-day instructor-led class introduces participants to the big data and machine learning capabilities of Google Cloud. It provides a quick overview of Google Cloud and a deeper dive into the data processing capabilities.
Detect text in images by connecting Functions, Storage, Vision API, Pub/Sub, and the Translation API
React to Cloud Storage changes with a function that processes an image using the Vision API to extract text and then pass it to other services.
Automated Classification of Data Uploaded to Cloud Storage with the DLP API and Cloud Functions
Automatically classify data uploaded to Cloud Storage using Pub/Sub, Cloud Functions, and the Data Loss Prevention API.
Annotating multiple images in a single request and storing output in Cloud Storage
Run offline (asynchronous) detection services and annotation of a large batch of image files using any Vision feature type.
Setting a storage and processing location for OCR requests
Set a specific region to store and process resources used for an Optical Character Recognition (OCR) request.
Detecting and blurring offensive image content
Demonstrates using the Google Cloud Vision API and ImageMagick to detect and blur offensive images that get uploaded to a Cloud Storage bucket.
Translating and speaking text from a photo with glossaries (Advanced)
Use Vision API, Translation API, Text-to-Speech API to detect text in an image, personalize translations, and generate synthetic speech from the translated text.
Detect text in an image (OCR) and draw a border around the found text
Use Vision API to identify text in an image, and then annotate an image based on the text that is detected.
Related videos
Try Cloud Vision API for yourself
New customers also get $300 in free credits to run, test, and deploy workloads.