グーグルが音声認識APIを公開して音声認識企業Nuanceに対抗 - ニュースな英語のホンヤクコンニャク

Google opens access to its speech recognition API, going head to head with Nuance

フリーソフトでつくる音声認識システム－パターン認識・機械学習の初歩から対話システムまで－

posted with amazlet at 16.03.23

荒木雅弘
森北出版
売り上げランキング: 15,442

Amazon.co.jpで詳細を見る

Google is planning to compete with Nuance and other voice recognition companies head on by opening up its speech recognition API to third-party developers. To attract developers, the app will be free at launch with pricing to be introduced at a later date.

グーグルは、自身の音声認識 APIを、サードパーティの開発者へ公開してNuanceや他の音声認識の企業と競い合う計画をしています。開発者をひきつけるために、無料でアプリを公開し、価格は後日表明されます。

We’d been hearing murmurs about this service developing for weeks now. The company formally announced the service today during its NEXT cloud user conference, where it also unveiled a raft of other machine learning developments and updates, most significantly a new machine learning platform.

このサービスが開発されていることについて、この数週間、チラホラと聞こえてきていました。同社はこのサービスをNEXTクラウドユーザーカンファレンスで正式に発表しました。そこでは、他に開発者向けの機械学習プラットフォームも発表されています。

murmurs - つぶやき

The Google Cloud Speech API, which will cover over 80 languages and will work with any application in real-time streaming or batch mode, will offer full set of APIs for applications to “see, hear and translate,” Google says. It is based on the same neural network tech that powers Google’s voice search in the Google app and voice typing in Google’s Keyboard. There are some other interesting features, such as working in noisy environments and in real-time.

この、グーグルクラウドスピーチAPIは、80の言語をサポートし、どのようなアプリケーションでも、リアルタイムに、または、バッチモードで動作します。「見る、聞く、翻訳する」というフルセットのAPIを提供しますと、グーグルは言っています。これは、グーグルAppの音声検索や、グーグルキーボードの音声入力と同じニューラルネットワークの技術に基づいています。雑音の多い環境でもリアルタイムに動作するといった、いくつかの興味深い機能があります。

Google’s move will have a large impact on the industry as a whole — and particularly on Nuance, the company long thought of as offering the best voice recognition capabilities in the business, and most certainly the biggest offering such services. A number of Nuance customers, including startups, could leave it in favor of Google’s technology, which not only offers an improved experience over current providers, but will also be made available at a lower cost.

グーグルの動きは、大きなインパクトを持っている。すべての産業において、特に、長い間最高の音声認識能力を提供していると考えられている Nuance や、このようなサービスを大きく提供しようとしている産業においても大きなインパクトを与える。 Nuanceの顧客数は、スタートアップも含めて、グーグルの技術を好んで、立ち去る可能性がある。現在の提供者での体験が改善されるだけではなく、費用が安くなるからだ。

To attract developer interest initially, the API will be completely free to use. Over time, the API will be paid, but likely have low-cost pricing tiers, we understand. Google may choose to raise those prices over time, after it becomes the dominant player in the industry.

最初に開発者の関心を引くため、APIは完全に無料で使用できることになるだろう。時間とともにAPIは有料になるが、しかし、低価格帯になると私たちは理解している。グーグルは時間とともに、この産業を支配的になったあと、価格を吊り上げることを選択するかもしれない。

Google has offered limited access to its voice technology in its products to date. Developers can make JavaScript calls to the Chrome API, for example, which are then routed to the speech recognition API. And Google announced a Voice Interaction API at Google I/O in 2015, which allows Android developers to add voice interactions to their apps. But Google had yet to open up access to the speech recognition API directly.

グーグルはこれまで、自身の製品の中で、その音声技術へのアクセスを制限していました。開発者は、JavaScriptで、例えば、このスピーチ識別APIへルーティングされる Chrome API を呼び出せます。そしてグーグルはボイス・インタラクションAPIを2015年のGoogle I/Oで発表しました。これは、Android開発者のアプリに、声による操作を追加できるようにするものです。しかし、グーグルはこのスピーチ識別APIへの直接アクセスをまだ公開していません。

The introduction of the speech API won’t only impact Nuance and other speech recognition providers, but is also being seen as an attack on Apple, whose virtual assistant Siri’s voice recognition capabilities pale in comparison to Google’s. It’s also yet to offer an API for developers to use the Siri tech in their own apps.

このスピーチAPIの紹介は、Nuanceだけでなく、他のスピーチ識別プロバイダーへもインパクトを与えるでしょう。しかし、バーチャルアシスタント Siriの音声認識能力が、グーグルとの比較で青ざめたAppleを襲った時にも見られました。開発者が彼らのアプリでSiriの技術を使用するためのAPIはまだ提供されていません。

There were hints that Google would be putting a greater emphasis on its voice technology and many use cases. For example, the company announced in February that it would allow Google Docs users to edit and format their documents by voice.

グーグルの音声技術に重点を置くだろうという情報がありました。例えば、同社は2月にGoogle Docsユーザーは、音声によってドキュメントを編集できるようにすると発表しました。