Azure Scanned PDF to Searchable PDF - azure

I am looking for a way using Azure services to create a text searchable PDF from an image based (scanned) PDF. I have looked at the Computer Vision Read and OCR APIs from Cognitive Services but they both return JSON with the bounding boxes of the text and what I want is a PDF that is image with hidden text.
I am specifically looking for an Azure service to do this for me. I know there are other services to do this (AWS Textract, Abbyy, etc.) and I know I could write code to do it, but neither of those options are what my client is looking for.
Thank you!

Related

Is there an Azure platform service that can convert text from pdf files and save those unstructured data in database?

Our organization is migrating our routine work onto Azure Cloud platform. One of my works is using Python to read many pdf files and convert all the text/unstructured data into tables, e.g.
first column shows the file name and second column saves all the text data etc.
Just wondering is there a service in Azure platform that can achieve this automatically? I am new user to Azure, so not quite familiar with this. Thanks heaps if any help.
I would recommend looking at Azure Form Recognizer. You can train it to recognize tables and extract data from PDF files.

Where can I find Azure TCO calculator file format sample?

I am trying to use Azure Total cost of ownership (TCO) calculator and could not find a link to documentation that provide details regarding what file formats are supported for bulk upload. Neither I could find any recommended sample Excel file format that I can readily use. Ideally these documents or sample file format should be on the Azure TCO calculator page itself. Any pointers in this regards will be greately appreciated.
I found the bulk upload tool template hidden in the wizard. It would have been nice to also provide link to this template on the main page of Azure TCO calculator. You have to click on Bulk Upload tool and step 1 provide the template for uploading your inventory/discovery data.

Is it possible to fill in PDF forms through Google Script? Or add text on PDF through Google Script on the cloud?

Now basically all our jobs are deployed on Google Suites and triggered through Google Script, so what we want is that we can add this function - filling in the PDF forms - using Google Script as well.
Before what we are trying to do was to design a PDF-like template using Google Sheet and by looping, updating the blank information of the form, and finally saving the result as PDF, we will have the result that we want.
However in this time, the PDF is not that easy to imitate in Google Sheet. It is from the government and thus it is better to keep their format and simply fill in the PDF form.
Actually, we've tried to use the Excel VBA or Python on the hard drive to do and it is pretty simple task to finish. But the problem would be wasting time to switch between platforms and it is not a good solution to deploy everything on the Cloud.
So we are wondering if there is a way to 1) filling PDF forms through Google Script? 2) Or add text on PDF through Google Script on the cloud? 3) Or do it on the cloud in general?
Google Apps Script does not have a native way to programatically fill in an existing PDF.
You'll have to find some 3rd party service/API, or a JavaScript library, that lets you programatically fill in an existing PDF and then you can call it or use it in GAS.

Data extraction using Azure Cognitive services

Goodday!
I've got a library full of documents. I would like to be able to know some more about these documents without reading through all of them. On the first page of each document I already put some relevant information about the document such as year of creation, author, number of pages and confidentiality.
I want to use some AI to browse through these documents and decide which is what and return it to me so I can store it in a CSV or something. This way I'll be able to easily find the right document.
Now, I did use LUIS and some of the TAA service of Azure's cognitive services before. But I cannot figure out a way how to accomplish this.
Is it theoretically possible to send the first page (using a powershell script to cut the rest of the document and send it through) to LUIS, which can then return the entities? I'd like them in a way: Date of creation: 18/10/2018, pages: 100.
Or is there a better way to use Azure's powerfull AI to get this done?

how to display an image from a blob in a pdf in cognos

We recently adopted Cognos for reporting and it has proven to work well in many situations. But now I am facing a problem that I consider weird, as most tools used for reporting support that.
I have a table in the database where I store images as blobs. I wanted to display these images in my Cognos report to find out that it is not supported out of the box.
If the report format is in HTML, then the solution is to put an image control on the report and construct the URL to point to some web application that you made that accesses the database and returns the image when that URL is called (there may be other solutions but this is "A" solution I know of).
But in my case I use the PDF report format, and I searched all over the internet and I couldn't find anything that talks about it.
So my question is, is there anyway to display blob images in Cognos PDF reports or is it not supported at all and there is nothing I can do about it?
Does any cognos version support this or do I have no hope of having something like that?
P.S. I am using cognos 8.4.1.
I think you are out of luck since Cognos needs to read the images from disk when generating the PDF.
Here's some alternate solutions:
Try to extract the images you need from the database and write em to disk and then you can use Cognos to read the images from disk with tag or image tools from Cognos.
OR
Build a web service that act as an image but reads the data from database (proxy) so when the image is called in tag, the webservice returns the bytes from database. It could work though I never tried it. Worth a shot imo ;)
You can fin some example here :
http://www.codeproject.com/Questions/466769/save-and-retrieve-some-online-image-in-database-us
http://forums.aspfree.com/code-bank-54/create-webservice-retrieve-image-database-23738.html

Resources