How to download subtitles from pluzz.fr - text

Is there a way to download subtitles/closed captions from pluzz fr or the beta version, http://pluzz.francetv.fr, please?
Here's an example video that comes with closed captioning (Tout le monde veut prendre sa place): http://pluzz.francetv.fr/videos/tout_le_monde_veut_prendre_sa_place_,69573658.html
By the way, you'll need a French IP to watch the video.
Regards.

I've been looking into this as well and have made some progress. To update your example, (to a video that is available for the next 6 days) I've used http://pluzz.francetv.fr/videos/tout_le_monde_veut_prendre_sa_place_,70697467.html
Using the ID number at the end of that URL, go to this URL to get the XML-like info file about that program i.e. www.pluzz.fr/appftv/webservices/video/getInfosOeuvre.php?mode=zeri&id-diffusion=70697467
If you look in that resulting file, you will see various into about the program including at the end: incrust type="accessibilite". If subtitles are available, this is followed by a relative URL pointing to the subtitle file. For this example, you can see:
<incrust type="accessibilite">/appftv/webservices/video/getFichierSmi.php?smi=france2/70697467.smi&source=azad</incrust>
Prefix that relative link with the pluzz website, and the result is: http://www.pluzz.fr/appftv/webservices/video/getFichierSmi.php?smi=france2/70697467.smi&source=azad
which which provides you with the subtitles in SAMI format (file extension .smi). This is an XML like subtitle format, but I haven't (yet!) found any player that can correctly display them nor found an easy way to convert them into a clean SRT format. However, I do think it would be a relatively simple task (for someone with some programming skills) to convert it into SRT format, but that isn't me... Perhaps the OP has those skills and can take it from here??
PS. Also of interest is the fact that you don't need a French IP to access the info file or the subtitles file, just the video file.

Related

Azure Media Services: Provide custom file names for the asset files

I'm encoding a video file using the built-in adaptive streaming transform. Once the file is successfully processed, an asset container is created with the below files:
Is it possible to provide custom file names at the time a job is created? It seems that the default behavior is to take a certain number of characters from the original file name and prepend them in the above file names. If possible, I'd like to configure this behavior.
P.S. I'm using the .NET SDK.
You can create a custom transform to output file names differently. On https://learn.microsoft.com/en-us/rest/api/media/transforms/createorupdate#definitions search for the Mp4Format section. In that you can specify the filenamePattern with certain macros like {Bitrate} and {Codec}.
See https://learn.microsoft.com/en-us/azure/media-services/latest/custom-preset-cli-howto for an example custom transform and the process by which to create it in Media Services.
I use the macros on my jobs, they work ok. I have a process that takes 3 videos (an intro section, the actual content, and the outro section) and encodes them as one single video. The issue I have with the macros is that it uses the file name of the first video in the inputs. So it ends up using the file name of the intro video which is a generic name. They need to have a way where we can have a little more control.
I suppose I could copy/rename the intro video to a desired name before I encode and it would pick it up, but that seems to be a little bit of overkill.
The Macros are good, but they could use some enhancements I think.

A Study on the Modification of PDF in nodejs

Project Environment
The environment we are currently developing is using Windows 10. nodejs 10.16.0, express web framework. The actual environment being deployed is the Linux Ubuntu server and the rest is the same.
What technology do you want to implement?
The technology that I want to implement is the information that I entered when I joined the membership. For example, I want to automatically put it in the input text box using my name, age, address, phone number, etc. so that the user only needs to fill in the remaining information in the PDF. (PDF is on some of the webpages.)
If all the information is entered, the PDF is saved and the document is sent to another vendor, which is the end.
Current Problems
We looked at about four days for PDFs, and we tried to create PDFs when we implemented the outline, structure, and code, just like it was on this site at https://web.archive.org/web/20141010035745/http://gnupdf.org/Introduction_to_PDF
However, most PDFs seem to be compressed into flatDecode rather than this simple. So I also looked at Data extraction from /Filter /FlateDecode PDF stream in PHP and tried to decompress it using QPDF.
Unzip it for now.Well, I thought it would be easy to find out the difference compared to the PDF without Kim after putting it in the first name.
However, there is too much difference even though only three characters are added... And the PDF structure itself is more difficult and complex to proceed with.
Note : https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf (PDF official document in English)
Is there a way to solve the problem now?
It sounds like you want to create a PDF from scratch and possibly extract data from it and you are finding this a more difficult prospect than you first imagined.
Check out my answer here on why PDF creation and reading is non-trivial and why you should reach for a tool you help you do this:
https://stackoverflow.com/a/53357682/1669243

Easily differentiate video files from image files in Node

I'm building a project where people can upload files, I would like to then display those files in a browser where people can interact with them (vote, comment etc)
However, this means I need to programatically build the html depending on the format of the video or image. Is there a way to feed a file (or filename) into a library, and determine whether I need to display it in a video element or an image element? Even a list of video formats vs image formats would help but I haven't seen anything in regards to that.
No module can reliably determine the file type. The user could either change the extension or even the magic number of the file to obfuscate it. The only reliable way it to try to pass file to some image / video transcoder to let it decide or error out if the format is invalid. This way you know you are working with known formats since all files are transcoded to your specific extensions. That could be mp4 or png. I recommend using handbrake for videos and sharp for images. Leaving the NPM links down below:
https://www.npmjs.com/package/handbrake-js
https://www.npmjs.com/package/sharp

Need a way to write headers on a wav file generated by sox

I'm using sox to convert some mp3 files to wav for a project. The problem is that the software that plays the files does not have the media name for the element it is playing. I can't seem to find a win32 cli tool to read the header of the wav file and write what I need to it.
Sox will read the header but it's not showing the title of the media element that was inserted with the old software I used. I just couldnt automate it or I would have used it instead.
I have determined the info is written in either CART CHUNK section or just to the file headers. I can't figure out to write my own data there.
The way I was able to do this was with this project: https://github.com/JamesHeinrich/getID3

how can I add tracking meta information to jpg files?

I have a client that wants to be able to find their proprietary images other places on the internet if someone steals them.
Is there a way to add meta information that we can track for this purpose inside a jpg file?
My answer was getting too long for comment so posting it as answer.
you can add an entry in EXIF IFD(metadata in binary) of jpeg image. It is not visible in image as it is. There are many EXIF editor/reader tools which will allow you to edit/read it. for example you can add a particular string in makernote of jpeg.
As I said, there are tools available in market, this also means that people can edit also reset the complete EXIF metadata which will erase your data too....
To get around this you can add your copyrighted IFD entry which only you can understand. For this you have to edit some EXIF tool and tweak according to your need.
But while adding your entry in JPEG metadata please take care that you adhere to EXIF/TIFF specifications so that image is not corrupted and readable in all platforms.
Here are some links to help you understand EXIF/TIFF -
http://gvsoft.homedns.org/exif/exif-explanation.html
and
http://www.exif.org/Exif2-2.PDF

Resources