aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
aster.cloud aster.cloud
  • /
  • Platforms
    • Public Cloud
    • On-Premise
    • Hybrid Cloud
    • Data
  • Architecture
    • Design
    • Solutions
    • Enterprise
  • Engineering
    • Automation
    • Software Engineering
    • Project Management
    • DevOps
  • Programming
    • Learning
  • Tools
  • About
  • Technology

Speech AI Year In Review

  • aster.cloud
  • January 16, 2023
  • 3 minute read

Almost anywhere you looked, AI-based speech technologies continued to blossom in 2022, from increased interest measured in Google Trends, to surprising medical advances that suggest speech patterns can help detect some illnesses, to the variety of digital services and devices that users control with their voices.

At Google Cloud, we spent 2022 making the best of Google’s speech AI and natural language technologies available to our customers, who are leveraging these technologies for use cases that range from robots that can help foster healthy childhood development, to customer service improvements based on data from phone calls, voicemails, and other speech interactions.We expect speech AI technologies and related advancements to significantly impact business and the world in coming years, as Andrew Moore, Google Cloud’s General Manager for Cloud AI & Industry Solutions has explored. To make sure you head into 2023 with all the latest news, below are some of our most noteworthy Speech AI announcements from the last year:


Partner with aster.cloud
for your next big idea.
Let us know here.



From our partners:

CITI.IO :: Business. Institutions. Society. Global Political Economy.
CYBERPOGO.COM :: For the Arts, Sciences, and Technology.
DADAHACKS.COM :: Parenting For The Rest Of Us.
ZEDISTA.COM :: Entertainment. Sports. Culture. Escape.
TAKUMAKU.COM :: For The Hearth And Home.
ASTER.CLOUD :: From The Cloud And Beyond.
LIWAIWAI.COM :: Intelligence, Inside and Outside.
GLOBALCLOUDPLATFORMS.COM :: For The World's Computing Needs.
FIREGULAMAN.COM :: For The Fire In The Belly Of The Coder.
ASTERCASTER.COM :: Supra Astra. Beyond The Stars.
BARTDAY.COM :: Prosperity For Everyone.

Visual interface for the Speech-to-Text (STT) API

In February, we announced a visual user interface for our STT API, which supports over 70 languages in 120 different local variants. The STT API lets developers convert speech into text by harnessing Google’s years of research in automatic speech recognition and transcription technology—and with the visual interface, the API is that much more intuitive, helping more developers to more easily tap this technology for their projects. We celebrated the fifth anniversary of this API in April, noting that the API processes over 1 billion spoken minutes of speech each month, enough to transcribe all U.S. Presidential inauguration speeches in history over 1 million times.

Read More  How Wayfair Says Yes With BigQuery—Without Breaking The Bank

Support for custom voices in the Text-to-Speech (TTS) API

In March, we announced the general availability of Custom Voice in our TTS API, which lets customers create natural, human-like speech from text. Custom Voice lets customers train voice models with their own audio recordings, so they can offer users unique experiences. Customers simply submit audio recordings directly in the TTS API, which includes guidance to ensure high-quality models are created.

Improved STT API models

In April, we launched our newest models for the STT API, based on a new approach that uses a single neural network — as opposed to separate models for acoustic, pronunciation, and language training — and combines a transformer model with convolution layers. The result is significantly improved accuracy across dozens of the languages and dialects that the STT API supports. In December, we added the latest models for more languages including Bulgarian, Swedish, Romanian, Tamil, Bengali and more, bringing the total languages for latest models to over 45. See the full list here.

Large language models (LLMs) for the Natural Language (NL) API

In the fall, we updated the NL API with a new model for Content Classification based on Google’s groundbreaking research on LLMs, which includes projects like LaMDA, PaLM and T5. Thanks to both the integration of cutting-edge language modeling approaches and an updated and expanded training data set, Content Classification supports over 1,000 labels and 11 languages: Chinese, French, German, Italian, Japanese, Korean, Portuguese, Russian, Spanish, and Dutch.

Text-to-Speech Neural2

At Google Cloud Next ‘22, we announced the availability of our next generation of TTS voices, Neural2. These voices build on Google’s created PnG NAT technology, which we use to power our Custom Voice offering. Neural2 voices bring the same improvements customers see from PnG NAT in Custom Voices to default voices. In December, we made Neural2 generally available and now have default voices available in: English, French, Spanish, Italian, German, Portuguese, and Japanese. See the full list here.

Read More  From Data Chaos To Data-Driven: How Dedicated Data Teams Can Help Edtechs Influence The Future Of Education

Speech services even without a network connection via Speech On-Device

At Google Cloud Next ‘22, we made Speech On-Device generally available, eliminating the frustration of trying to access voice services without a network connection, such as when driving far from coverage or when network outages occur. Toyota is already making use of Speech On-Device as Ryan Wheeler — Vice President, Machine Learning at Toyota Connected North America — discussed in a Google Cloud Next ‘22 session.

We look forward to continuing to bring Google’s most innovative and impactful research to our cloud services in 2023—but in the meantime, to learn more about using Google Cloud speech AI products, check out this guide, these codelabs, and our Responsible AI page.

 

By: Keelin McDonell (Product Manager, Cloud AI and Industry Solutions)
Source: Google Cloud Blog


For enquiries, product placements, sponsorships, and collaborations, connect with us at [email protected]. We'd love to hear from you!

Our humans need coffee too! Your support is highly appreciated, thank you!

aster.cloud

Related Topics
  • Artificial Intelligence
  • Google Cloud
You May Also Like
View Post
  • Gears
  • Technology

Samsung Art Store Brings Art Basel to Homes Worldwide With New Curated Collection

  • June 15, 2026
View Post
  • Technology

The consequences of relying on AI for accurate news

  • June 10, 2026
View Post
  • Gears
  • Technology

WWDC26: Apple unveils next generation of Apple Intelligence, Siri AI, powerful parental controls, and an expansive set of software improvements

  • June 8, 2026
View Post
  • Technology

IBM and Google Cloud Announce Strategic Partnership to Scale AI with Human Expertise and AI‑Powered Delivery

  • June 4, 2026
View Post
  • Technology

Banks race to patch new cyber vulnerabilities, and other cybersecurity news

  • May 25, 2026
pope-leo-xiv-cq5dam-1500.844
View Post
  • Technology

Pope Leo XIV to Publish First Encyclical on Artificial Intelligence and Human Dignity on 25 May

  • May 22, 2026
View Post
  • Technology

Portfolio to Clients, and is Strengthened by Ongoing Project Glasswing Work

  • May 20, 2026
reMarkable Paper Pure
View Post
  • Gears
  • Technology

Everything The reMarkable Paper Pure Actually Does

  • May 14, 2026

Stay Connected!
LATEST
  • 1
    Expectations vs. Reality: The AI We Thought We’d Have in 10 Years
    • June 19, 2026
  • digital-nomad-freelancer-worker-2151205464 2
    One paperwork problem – Get your Digital Nomad Visa employment documents fast from UK, EU or Singapore
    • June 16, 2026
  • 3
    Samsung Art Store Brings Art Basel to Homes Worldwide With New Curated Collection
    • June 15, 2026
  • 4
    You Do Not Need to Invest in the IPO of SpaceX, Anthropic, and OpenAI
    • June 10, 2026
  • 5
    The consequences of relying on AI for accurate news
    • June 10, 2026
  • 6
    Connecting AI agents with unstructured data using Google Cloud Storage MCP Servers
    • June 10, 2026
  • 7
    WWDC26: Apple unveils next generation of Apple Intelligence, Siri AI, powerful parental controls, and an expansive set of software improvements
    • June 8, 2026
  • 8
    IBM and Google Cloud Announce Strategic Partnership to Scale AI with Human Expertise and AI‑Powered Delivery
    • June 4, 2026
  • Data center 9
    Data Sovereignty in Spain. It’s Not Just About the Law, It’s About Efficiency
    • June 3, 2026
  • 10
    Ink vs Pixels. What you miss versus what you are actually missing.
    • June 1, 2026
about
Hello World!

We are aster.cloud. We’re created by programmers for programmers.

Our site aims to provide guides, programming tips, reviews, and interesting materials for tech people and those who want to learn in general.

We would like to hear from you.

If you have any feedback, enquiries, or sponsorship request, kindly reach out to us at:

[email protected]
Most Popular
  • 1
    Banks race to patch new cyber vulnerabilities, and other cybersecurity news
    • May 25, 2026
  • pope-leo-xiv-cq5dam-1500.844 2
    Pope Leo XIV to Publish First Encyclical on Artificial Intelligence and Human Dignity on 25 May
    • May 22, 2026
  • 3
    Portfolio to Clients, and is Strengthened by Ongoing Project Glasswing Work
    • May 20, 2026
  • reMarkable Paper Pure 4
    Everything The reMarkable Paper Pure Actually Does
    • May 14, 2026
  • 5
    Scaling cloud and AI: Microsoft Azure’s commitment to Europe’s digital future
    • May 11, 2026
  • /
  • Technology
  • Tools
  • About
  • Contact Us

Input your search keywords and press Enter.