I have the usecase to read the text from the PAN Card. Ideally the application should have the screen to scan the PAN Card and the text should be extracted from there. The extracted texts will be auto populated on the further screens.
I have read about tesseract npm module, but still didn't have the clue where to start as there is no compete blogs available for this usecase over the internet. Also tried the npm module - okrabyte, this is not giving 100% result. Any guidance or help would be required.
I tried AWS Textract service as well. This is not helping to parse the PAN CARD as the extracted results were completely different.
You need to use OCR to achieve this. There are various options for doing this. Tesseract is open source. I hope this blog helps you get started with tesseract on nodejs.
You can use OCR apis from different cloud providers to achieve this as well. Example: Microsoft Cognitive Services Vision API, Abbyy Cloud, etc.
Also, improving the quality of your image helps in extracting text with higher accuracy. Personally, I've seen big difference between 200 dpi images vs 600 dpi images.
Hope this helps!
Related
Not entirely new to azure, but new to the Media Services available on azure. I am looking for suggestion on what azure components I should consider to build a solution to analyze video for certain conditions.
(e.g. 1) Presence of a human - Yes/No, 2) alert if no human presence detected for a certain number of minutes, 3) confirmation if identified human is wearing a uniform or not, etc. )
I have built a somewhat similar on-premise solution in the past using OpenCV & some open source ML libraries, not sure what azure services I can use if this will be running in Azure.
I can live stream this to azure and am not looking for an edge solution.
I looked up azure video indexer and it looks promising, but probably more tuned for audio analysis rather then image frame analysis.
suggestions would be appreciated.
Azure video indexer is optimized for files, not streams, but is capable of meeting the requirement since it detect faces and people (in advanced preset).
Regarding uniform or not, this is not supported in video indexer at the moment but ability to detect cloth color will come in the future.
By fragmenting the video, Azure Video Indexer provides a near live solution. It means there will be a few minutes delay, so it depends on how time-sensitive your requirements are.
Regarding your second question, it will be possible to customize a model to identify specific uniforms in a few months. When the bounding boxes of the uniforms match the bounding boxes of the detected people, you can identify if a person is wearing a uniform.
I`m software developer and trying to upload some videos using Google Photo API, but I want to set up "High quality mode" of such videos programmatically.
So, the point is that my app must upload videos without user participation every 15 GiB.
Could anybody advice to me - how to do it?
Thanks for help!
All media items uploaded to Google Photos using the API are stored in full resolution at original quality. They count toward the user’s storage.
check here: https://developers.google.com/photos/library/guides/api-limits-quotas
On the bottom of this Amazon Cloud Drive TOS : https://www.amazon.com/gp/help/customer/display.html?nodeId=201376540
6.6 Notice to Illinois Residents. The image recognition features of the Services are not initially enabled for residents of the State of
Illinois. If you are an Illinois resident and wish to use the image
recognition features of the Services for the photos stored in your
account, you are required to read and agree to the important legal
information regarding your use of such features here.
I wonder what is the problem of this particular state? Note that "here" leads to a broken link.
Apparently there is an issue around whether facial recognition is legal or not. See the article below:
http://www.forbes.com/sites/legalnewsline/2016/07/05/il-facial-recognition-law-leads-to-wave-of-class-actions-against-facebook-others/#52ac17294e56
I'm developing an app that will pull in static PNG visualizations of sound clips (30 seconds max). The images will then act as the background image of the player / scrubber in the UI.
I'm looking for APIs / tools that would support the processing and visualization of sound clips on the back-end, generating and saving a quality PNG. I thought Processing might be an option, but am not yet sure if it has these specific capabilities (it's also not really designed to be server-side). Any and all suggestions would be great.
Related - if anyone is an expert in this, and can give me insight into the type of data that can be extracted and visualized from sound, that would also be great. Though, I am hoping by identifying possible tools or APIs, that information will become more clear.
Thank you.
Claudia
I'm doing a language web site for my university language center, where students login and see videos to learn English. i have to do it like this,
person is logging in to the system, search using a search area and find the details,lessons and videos relevant to that videos. this functionality exactly matches the youtube scenario.
for implementing twitter like functionality we can use status-net, is there a similer library, statusnet like famous implementation for youtube or a some kind of platform or a framework like codeigniter that we can use to implement youtube like site very easily??
please suggest some options?? a open source one or a commercial one ???
and what is the best video format to use in a such web site?? flv?? mp4?? or mov???
regards,
Rangana
Your best option is to use a 'cloud' based video processing service. Most have a sample project / library for many different languages and frameworks. Here is a list of a few I've tried and liked:
http://zencoder.com/
http://transloadit.com/
http://pandastream.com/
The typical steps involve uploading the video files to a large 'cloud' static asset host (such as S3) through the browser. If you are inexperienced it is best to select a processor that provides an uploader (it will handle putting the files in the right spot). Of the three, Transloadit and Panda both have custom unloaders.
Usually the service will allow you to either pass the encoding settings (what formats and qualities to) output to as parameters or configure them in your account. To support all current HTML5 browsers you just need H264 (.m4a) and OGG (.ogv). However, the new trend in the video world is for WEBM (.webm) so you might want to include it as well.
Next you will receive a unique code from the web service that you must store in persistent storage (database). The web service can be configured to 'callback' (perform an HTTP POST or GET request to your service) once the video is encoded.
Once your recieve a callback you can activate your video and start dislpaying it on your pages. For displaying, if you are inexperienced I'd highly recommend you use one of the following players:
http://sublimevideo.net/
http://longtailvideo.com/
http://videojs.com/
They all do similar things for different prices. My current personal favourite is Sublime Video (it offers cool light box effects and a gorgeous player).
Why do you have to re-implement Youtube when you can just use it for hosting your videos for free? Many online e-learning portals (e.g. Khan academy) do exactly that.
As far as the best video format to use -- go read about H.264/AVC. It's what Youtube currently uses.
I think you will not find already built solution ;)
But it's not really that hard. You can use existing frameworks that will make your life easier while you build account management system, the rest shouldn't be really that hard (assuming you don't really want to re-build the whole Youtube ;D ).
For playing videos, you can use JW Player. A great piece of software, you should check it out.