Azure Media Services for transcoding and delivering audio - azure

I have a common use case scenario where I want to do the following
Upload an audio file. (wav/mp3)
Transcodes to 128k or 192k mp3.
Stores the audio asset.
Allows the audio asset to be streamed.
Supports streaming actions such as play pause and seek.
The documentation for azure media services seems like it might be able to support this but I am not too sure, seems like they focus on video content. Anyone have experience with this?

You can manage audio and encode audio only assets with azure media services.
WAV is supported input format/conatiner as a input asset. To see full list of supported formats check following link:
https://azure.microsoft.com/en-us/documentation/articles/media-services-media-encoder-standard-formats/
Check https://github.com/Azure/azure-content/blob/master/articles/media-services/media-services-custom-mes-presets-with-dotnet.md#audio_only to see audio only preset options which you will use to encode an audio only preset.

Related

when video or audio is played from a uri is it streamed or downloaded fully and played?

I have a content creation site I am building and im confused on audio and video.
If I have a content creators audio or video stored in s3 and then I want to display their file will the html video player or audio player stream the media or will it download it fully then play it?
I ask because what if the video or audio is significantly long. like 2 hours for example. I need to know how to solve the use case.
Lastly what file type is most acceptable for viewing on webpages? It seems like MPEG-4 is the best bet. Is that true?
Most video player clients and browsers will attempt to stream the video if they can.
For an mp4 video file hosted on a server, so long as the header is at the start and the server accepts range requests, this will mean the player downloads the video in chunks and starts playing as soon as it has enough to decide the first frames.
For more professional streaming services, they will generally use an adaptive bit rate streaming protocol like DASH or HLS (see this answer: https://stackoverflow.com/a/42365034/334402) and again the video will be streamed in chunks, or segments, and will start playing while it is streaming.
To answer your last question you need to be aware that the raw video is encoded (e.g. h.264, VP9 etc) and the video, audio, subtitle etc tracks stored in a video container (e.g. mp4, Web etc).
The most common format is probaly h.264 encoded and mp4 containers at this time.
The particular profile for h.264 can matter also depending on the device - baseline is probably the most supported profile at this time. You can find examples of media support for different devices online, e.g. for Android: https://developer.android.com/guide/topics/media/media-formats
#Mick's answer is spot on. I'll just add that mp4 (with h264 encoding) will work in just about every browser out there.
The issue with mp4 files (especially with a 2 hour long movie) isn't so much the seeking & streaming. If your creator creates a 4K video - thats what you'll deliver to everyone (even mobile phones). HLS streaming on the other hand has adaptive bitrates - where the video adapts to both the screen & the available network speeds. You'll get better playback results with less buffering (and if you're using AWS - a LOT LESS data egress) with video streaming.
(there are a bunch of APIs and services that can help you do this - including api.video (where I work), Mux and others).

Is there a way to add 480p and 240p layers to Content Adaptive Multiple Bitrate MP4 preset in azure media services

I am using Azure Media Services for video streaming but have a problem can't solve.
Is there a way to add the 480p and 240p layer to Content Adaptive Multiple Bitrate MP4 preset?
I want to azure encoder decide my video bitrate but customize video layers.
After discussion, we confirmed that the current implementation of the Content Adaptive Multiple Bitrate MP4 preset does not permit further customization of the resolutions.

Processing of the audio file to the transcription using the Azure Cognitive Services SDK | Python 3.x

Microsoft has a library for the transcription, but in the official examples only have a microphone input. I want to understand how to transcribe an audio file of wav format.
Based on the official document, it only illustrates how to recognize speech from microphone input.
I think you could try to use custom audio stream,please refer to this article.
The Speech SDK's Audio Input Stream API provides a way to stream audio
streams into the recognizers instead of using either the microphone or
the input file APIs.
However,you need to make sure your format is supported by Azure Speech Service. As the statement in the document:
Currently, only the following configuration is supported:
Audio samples in PCM format, one channel, 16000 samples per second,
32000 bytes per second, two block align (16 bit including padding for
a sample), 16 bits per sample.

Real-Time Audio Encoding With Azure Media Services

Is it possible to get real-time audio encoding using Azure Media Services? We have an ASP.NET MVC C# site that we want to allow our users to upload an audio file and then immediately play that audio file back using a standard HTML5 audio tag.
I know I can upload the audio asset to azure and then ask it to encode it into an MP3 file so that it can be played using the audio tag but there may be to much of a delay in that process. Is there a way to upload the asset and then ask azure for an MP3 stream that it would encode in real-time so that I can play it to the user immediately after the upload completes.
If it cannot be done with azure is there a different service that offers that capability?
Currently, we do not provide a real-time transcoding option where the playback request triggers a real-time transcode.
An option for you may be to run ffmpeg directly in an Azure Function.

Windows Azure live media encoders provide live transcoding?

I have a simple question - I want to stream live video + audio. I would like to use Windows Azure for that (mainly because it seems to provide HLS with AES protection which I have not encounted in opensource solutionsand clear for managers pricing per streaming user) I amtrobuled because of next quote:
Currently, Media Services does not provide a live transcoding service.
You can use one of the following third party live encoders that output
RTMP or Smooth Streaming formats: Elemental, Envivio, Cisco, RGP
encoders output Smooth Streaming; Adobe Flash Live, Wirecast and
Tredek encoders output RTMP.
And a few lines after
You can deliver your live stream in any of the following formats:
Smooth Streaming, DASH and HLS. When doing live streaming, HLS is
packaged dynamically and the default HLS packaging ratio is 3 Smooth
fragments to 1 HLS segment (3:1).
...
Configure a live transcoder.
Every time you reconfigure the transcoder, call the Reset method on
the channel.
So no transcoding is provided yet I shall set up a transcoder... What? How?
In FFmpeg there are 2 types of transcoding
from one encoded data format to another (say PCM raw data to encoded MP3 frames)
from one frame/packet type to another (say MP4 frames of already encoded audio/video to FLV frames format with same encoded data in them)
Do they try to tell me that they provide frames repacking from RTMP to HLS yet no live encoding into another compression type (say from Speex audio to AAC)?
As I answered on your another post, you can use tool like Wirecast 6 to encode your live stream and push the stream into Azure Ingest URL. We will give you a publish URL that could dynamically package content into HLS, Smooth Streaming and DASH.
For more information, please refer to this post: http://azure.microsoft.com/blog/2014/09/10/getting-started-with-live-streaming-using-the-azure-management-portal/
Yes. The second type of transcoding you describe can be better named transpackaging because no video coding is done.
Transcoding is not provided. Transpackaging is provided.

Resources