I have an Excel sheet with fields such as [name][url in folder][keywords] ... I am trying to find the best way to write IPTC metadata keywords to my 60'000 TIFF images in order to be able to search through them (with Adobe Bridge) from this Excel file. I have tried exiftool.exe but "Adobe Bridge" cannot read the rendering keywords. I have seen that it may be possible in PHP, but I would like to know if code or software already exists.
Any IPTC library can do it for you. I use Python so for example http://tilloy.net/dev/pyexiv2/ would be my tool. Look at the tutorial on http://tilloy.net/dev/pyexiv2/tutorial.html
Related
I'm developing a web page. I want to create an option to generate a .pdf file and allow the user to download it.
Currently I'm using jsPDF but I'm finding it very hard to properly format the document.
I was hoping to find a new way of building it in markdown format, compile it and then download it.
Is there a way that I can do this, in node.js, where say, I have a string in memory (which is the markdown text format), compile that into a pdf and then download it from the page?
I haven't found any package that really does this, if you know, feel free to just let me know which one can achieve this and I'll figure it out.
For such a thing, I recommend building your .pdf file first a HTML file, so you could edit it easily (hardcoded or dynamicaly)
then convert your html file to .pdf file.
there is alot of packages to do this
have a closer look on this package
https://www.npmjs.com/package/html-pdf-node
Does anybody knows how to create a structured report using dicom scope toolkit via console (ubuntu 16.04) with a link to a related image?
The thing is that I have an image of some kind of trauma and I have to connect with a report which is in a text file. The last file should be in .dcm format which contains annotation and a link to an image. I have to use dicom scope program.
Maybe others refrain from answering because your question needs a very long answer. I cannot provide step-by-step instructions, a few hints, though.
The way I would go is to:
(assuming that your image is available in DICOM format):
obtain a sample structured report. I think that the "simple" Basic Text SR is what you want to go for. You can find some samples here.
convert the SR to an XML file using dsr2xml
edit the contents in XML. Do not forget to include your image reference in (0040,a730) Content Sequence -> (0008,1199) Referenced SOP Sequence
convert the XML back to DICOM SR using xml2dsr
By the way: From your question, I did not really understand why you want to use a structured report, as you wrote that your report is plain text. Instead of digging into the complex structure of SR, you may want to consider exporting the report to an Encapsulated PDF document which can reference images as well.
I'm new to this forum and to Orange.
I don't really now Python at this point but am ready to learn.
However, before going further in this environment I would like to know if it can answer my needs !
What I am basically doing is "transforming" PDF product catalogues into Excel files that can be used by another software to create a database for another software.
I have tiles catalogues in PDF just like this one :
and turn it into this type of xls table : http://imgur.com/BtLBkOS
I basically need it to retrieve the article number, the colour, the size (e.g: 20x20). The G/B parts are completed manually after it has been done.
All catalogues are not the same so I sorted out some using pdftotext, RegEx with Notepad++
But I would like to know if this data mining solution could work it out ?
Orange does not support reading PDF files. You will have to use specialized utilities or program it yourself.
I am developing a small excel plugin. As part of it I need to convert my excel document to PDF format. How can I do that? All the plugins and printers that I've found do not convert links created by the hyperlink function.
I know this is not programmatically related per say, but I am developing this software, and vba is quite a reasonable solution.
Thank you all very much!
I'm not sure any solution will work using the hyperlink function. The reason is typically the convert to PDF functions like a printer driver and the 'printer' only gets the helper text information not the underlying URL. But, using PrimoPDF, any text formatted like a URL "http://www.stackoverflow.com" for example, will get converted to a clickable URL (in the modern readers).
At the moment, we use MS WORD and MS EXCEL to mail merge documents that needs to be sent to multiple recepients.
For example, say there is a complaint form where the complainant needs to fill in his/her name, address, etc. So we have a .doc file set up with the content and the dynamic entities set up for mail merging, with the name and address details put in an excel file, from where we can happily mail merge to generate all or just the necessary forms/documents.
However, I would like to automate this process, like a form in a website where the complainant can fill in his/her name, address and other details, and we could use that to generate the complaint form automatically and offer it to be downloaded (preferrably as a pdf).
Now, the only solution that comes to mind, is Latex, so that I can just replace the needed entities and just compile to PDF. However, that bit has to be negotiated with the webhost, if they are offering Latex or not.
Is there any other solution? Any other way we could get this done, with something that shouldn't be a problem for most webhosting solutions to offer?
EDIT: I would prefer a non .NET or rather non microsoft solution since, the servers are running linux and while mono might be capable of getting the job done, none of our devs know any .NET languages. However, if required we might have to dwelve into it.
Generating PDF using an XSL. Check the following: Apoc XSL-FO
You will need to create an XML file with the required fields and transform that with this tool.
If you wish to avoid .NET then XSL-FO is worth a look. Try the FOray project.
XSLT can be a steep learn if you do not have experience already. Also users will not be able to change the templates without asking the XSLT guru to do it.
If your templates are already in MS Word and MS Excel then I would stick with generating MS docs on the server. These are now easy to work with from code since OpenXML - check out OfficeOpenXML and OpenXMLDeveloper
Apache FOP : http://xmlgraphics.apache.org/fop/
I suggest generating rtf on the server: it's easy enough to automatically generate using cpan's RTF::Writer, has converters generating good pdf, can be edited by hand in word, oo-writer & TextEdit, doesn't have any really bad compatibility issues between the main editing applications, and has decent text & resource extraction tools, with text extraction being rather better than pdf.
There's some support for moving between rtf & latex, although the best rtf -> latex converter, docx2tex, depends on the System.IO.Packaging .net module, whose mono implementation isn't yet rock solid.
Postscript — Not a recommendation: it's too much of an unwieldy sledgehammer for this job, but iText will generate the pdf directly from the form data. If you wanted to do fancy things like signed pdf, that would be the way to go.
Postscript #2 — If you break up the Word document into individual files using word's master document representation, then you can clobber one of the parts with hand-generated content. This makes it easy to do something approximating form-filling on word .doc files using just standard file-utils and some trivial rtf->doc tweaking.