I am trying to use google speech to text api I am getting error: contentExceptions#media-stream - speech-to-text

I am trying to use google speech to text api I am getting error: Permission to use microphone is blocked. To change, go to chrome://settings/contentExceptions#media-stream
can any one help how to resolve this.

Related

Are Wavenet voices still supported through the Node JS Google Text-to-speech client?

Are Wavenet voices still supported through the Node JS Google Text-to-speech client? It used to work before on v.1.3.0 of the NodeJs client and now it is not. I am getting the following error now when I try to generate speech using a Wavenet voice:
Request contains an invalid argument
Regular voices work fine.
I work on Google Cloud Text-to-Speech.
Yes, WaveNet is supposed to work. Can you try to reproduce your synthesis request using a curl command and copy/paste your command payload here?

Can't seem to find a way to play mp3

I've been hours going around this problem and I still can't solve it. Basically I get data from a database and using google text to speech I transform it into an mp3; after that I upload it to the google cloud storage. From there I use Twilio API to play the mp3 file when making an outbound call; I know I need to have a url for this file but I am very inexperienced in this and when I create a VoiceResponse() I can't input it. I am doing this all through Python. Is it possible for me to play the mp3 in the outbound call?
Best

Dialogflow/ API.AI for Google Home Mini App

This is my very first question and seems to be easy but coding it is somehow difficult...
So, basically I've got an Agent that should just call a URL and get the content of the website. This should then be spoken out by the smart speaker.
I already have a webhook working but now I'm stuck at getting the web-content..
Would be awesome if someone could please help me out here.
Best regards
Getting the web content is not easy . you have to parse the content in a web page using program , a simple logic is to detect '<' and '>' and remove all strings in between using string functions
Best of luck with that

Python: adding comments to specific cell using Drive API

I am trying to automate some simple updating of a Google spreadsheet and I'm using the gspread library to do so. One task that is critical and not currently supported by gspread is the ability to add comments to a specific cell (there's an open issue for this and even a gist solution but I was getting a 404 error when trying to use it).
I know that the Google Drive API (v3) supports adding comments as described here, but I'm having issues with authenticating and could use some help.
What I have/know:
I have already setup the OAuth 2.0 and registered for the API through Google, as well as have the client_secret.json in my directory, but my knowledge of web requests and responses is limited so going through the Drive API documentation hardly makes sense. I know in order to create the comments I will have to make use of anchors and specify the cell location using column/row numbers.
What I'm stuck on:
When using the Google API Explorer, I'm getting a 400 error with the message: The 'fields' parameter is required for this method. How can I make the POST request using my authentication? I think from there I'd be able to actually add the comments myself.
I'm getting a 400 error with the message: The 'fields' parameter is required for this method
The error is asking for a property which you want returned (these properties are listed in Drive API files resource).
You can just place ' * ' to indicate you want it to return a complete response. That's the quick fix.

Unable to upload large file to Speech To Text IBM Watson

When I tried to upload large file (from observation is greater than 8MB, around 3-4mins), I got "request is aborted" error.
Anyone experiencing similar issue?
I am using C# and uses the HTTP REST Interface of Speech to Text using sessionless method.
This is the url I am hitting: https://stream.watsonplatform.net/speech-to-text/api/v1/recognize?timestamps=true≺ofanity_filter=false&max_alternatives=3&word_confidence=true&word_alternatives_threshold=0.7&model=en-US_NarrowbandModel&continuous=true∈activity_timeout=600
I tried both using "Transfer-Encoding: chunked" and without.
Please help. Thanks.
Split on chunks and upload separately. Or use other services like speechmatics or even open soruce implementation, they work with very large files.
you can push files/streams of up to 100MBs in a single request using Watson STT API. If you are using C# please take a look at the .NET SDK, that may be a good start for you: https://github.com/watson-developer-cloud/dotnet-standard-sdk. And, of course, nothing like getting started with a curl command to build confidence, see details here: https://www.ibm.com/watson/developercloud/doc/speech-to-text/getting-started.html

Resources