I found Apache POI doesn't support reading SmartArt component.
(Am I right?)
So I tried Aspose.Slides and it worked.
But the trial version has critical restriction.
(can't get text has length over 5 characters)
Can anybody help?
You can read SmartArt with docx4j/pptx4j.
To see how it is represented, upload your sample pptx to webapp.docx4java.org/OnlineDemo/PartsList.html then click through into the /ppt/diagrams parts.
There is also a document Creating SmartArt with docx4j, though please note this is an advanced topic (and you are largely on your own).
#Shinbop,
I have observed your requirements and suggest you to please try using latest Aspose.Slides for .NET 18.11 on your end using valid Aspose.Slides license while using API. If you use API without license, it shall pose certain restrictions including limited text extraction from shapes inside presentation. You can please visit this official documentation section link concerning to SmartArt shapes.
I am working as Support developer/ Evangelist at Aspose.
Related
According to Microsoft ("Image Analysis" in https://techcommunity.microsoft.com/t5/Microsoft-SharePoint-Blog/Enrich-your-SharePoint-Content-with-Intelligence-and-Automation/ba-p/194174, from May 21, 2018), we should be able to search for text within images.
Is this working for you/anyone? If so, I would like to know what you had to do to get it to work.
I have a SharePoint modern team site with PNG images that contain clearly readable text...but search will not find anything. I have requested re-indexing.
I have had a Microsoft Support request (#10638094) open since June 27 with this question/issue, and no one--even after escalation--has been able to answer it.
Based on the article above, it appears that "MediaService" column(s) should be added to the library to support this; however, I can find no such columns in the environment (using PnP export to review).
Naomi Moneypenny and Kathrine Hammervold highlighted this functionality at Ignite 2017 (https://channel9.msdn.com/Events/Ignite/Microsoft-Ignite-Orlando-2017/BRK2181, about 27:00), but it doesn't seem to be available/working (at least not for me).
August 24: So, after research, digging yet further, I have an escalated support ticket at Microsoft (#10638094, unsolved) and there are conversations at https://techcommunity.microsoft.com/t5/Intelligent-Search-Discovery/Search-for-words-in-your-images-in-Office-365/ba-p/135703, https://techcommunity.microsoft.com/t5/Microsoft-SharePoint-Blog/Enrich-your-SharePoint-Content-with-Intelligence-and-Automation/bc-p/236625, and Does Office 365 image search work? If so, how?. I have yet to hear of this functionality working for anyone. I will keep digging, and I will certainly post if I hear anything. J
After some digging, from official it seems already released at the end of 2017. However there is no any related doc or official guide to this Text in image search function.
The 2 way i can think of perform text in image search.
Perform OCR yourself on the image before uploading the image and embed the text in image metadata.
Use support image type like IIRC and TIF that image are recognized.
In your case, you can upload the image and have another column that contains text and apply metadata to the image in a list/ library column.
OneDrive in another hand also has this function. For example, search for things like "cat" and it * should* pull up most pictures you have of cats. Its more likely using tag as label for the image instead of reading the picture it self.
Also, i believe OneNote has its index recognizable text and handwriting. Maybe this can point you to the right directions.
*Microsoft Azure's computer Vision offer service to recognized text in image. Maybe this can help.
"Is this working for you/anyone?" Yes, I responded to this post elsewhere and see it posted here, as well. Unfortunately, I cannot tell you HOW to get it to work or to verify that it is correctly configured. I can only suggest a test for you to see if it is working for you, as it works for me. I have not tested every way in which it could or should work. I have only discovered it working with PNGs I inserted into Wiki Pages in SharePoint Online. Those PNGs are generated using Snag-It to take Screen Captures and I do not see where Snag-It would be doing any OCR on the image to embed anything, etc. OCR is not even in the Snag-It help file, so I believe the PNG files are just simple PNGs. I insert them into the SharePoint Wiki page, which uploads them to the Site Assets library. And, when I search for a word in the image, the image is returned as a result - not the Wiki page. So, suggest you try a simple test of just inserting a PNG with text in it into a Wiki Page and give the index a bit of time to run to see if it works for you.
It seems like the functionality has matured recently. I have been testing it more thoroughly, and I have documented the results in my blog at http://www.collaboration-foundry.com/SharePointImageAnalysis.
Bottom line: It works for me in OneDrive and SharePoint (modern and classis), but I've only seen it work on the out-of-the-box Document content type--which limits custom solutions somewhat.
It's cool functionality when it works. Looking forward to seeing Microsoft build on this.
John
I am developing a Word add-in with Office.js that utilizes ContentControl.insertText. According to the documentation, this method is provided by WordApi requirement set 1.1, which is supported in Office Online. Furthermore, the documentation for ContentControls lists Office Online as being supported and does not mention any exceptions when it comes to using that particular method.
That said, when I run the add-in in native Word 2016, everything works fine, but when I run it in the Word Online environment, it fails with the following exception:
code:"NotAllowed"
errorLocation:"ContentControl.insertText"
message:"The action isn’t supported in Word Online. Check the OfficeExtension.Error.debugInfo for more information."
For reference, I am using it like so:
control.insertText(heading.Text, 'Replace');
I also tried running the example provided in the documentation, with the same results: it works in Word 2016, but fails with the above exception in Word Online.
Has anyone been able to successfully insert text into a ContentControl in the Office Online environment? If not, are there plans for this functionality to be supported?
Any input or clarification would be most appreciated! Thanks!
I'm an engineer from MS. There are some cases which contentControl.insertText is not allowed.
I think the cause of your case is that there is more than one paragraphs in the heading.text and you try to insert heading.text into a inline content control.
The inline content control means that the content control just some part of a paragraph.
In order to make sure the root cause of your scenario, could you share me the doc and the script code.
Office JS API for OneNote... Love it, but I am missing some critical things. Can someone comment?
I got a paragraph, type is RichText. But, I could not find in the API the style of the rich text. In my case, I want to know if it is a Header 1,2,3... or Quote, etc.
Same-page linking: In OneNote desktop I can right click any text and copy link to that specific paragraph. Clicking that link later will take me directly to that paragraph. However, I did not find an API that can navigate directly to a paragraph, the only one I could find navigates to a page: navigateToPage(page: Page) and navigateToPageWithClientUrl(url: string)
If that even possible? Also, I noticed these links don't work at all in the web version of OneNote, but that's a different story I guess.
I am building (a free!) TOC add-in, you can put at the top of your page and will potentially show all headers with links to the header in the page. However, the lack of the above capabilities make it impossible for such a simple add-in to work (or, at least I thought it is a very basic and simple one...)
Any help will be greatly appreciated!!! Like I said, if I get these 2 issues resolved - the add-in will be available for free.
https://dev.office.com/reference/add-ins/onenote/paragraph?product=onenote
Sounds like a cool add-in!
You can use the getHtml method on richText to get the style. There is an example in this answer.
OneNote Add in: Getting HTML content
As for creating links to a specific paragraph, OneNote add ins do not expose the capability of doing that. You can add a request in our uservoice. The only supported capability is to navigate to a page.
https://onenote.uservoice.com/forums/245490-onenote-developer-apis
As for links that work in OneNote online, the "webUrl" property in a page will contain a link that works in OneNote online.
https://github.com/OfficeDev/office-js-docs/blob/master/reference/onenote/page.md
Thanks for feedback. We will update the documentations.
There is currently no way to scroll to any region in the page.
Has anyone ever run across pain.008.002.02 format? It is an ISO standard used in Germany used for direct debit transactions. The problem I have is that although I had found documentation that talks about it, I did not actually find the XML Schema (pain.008.002.02.xsd) for it, which would be very useful in generating java bindings. Otherwise I will need to do this manually, which is simply hell. Does anyone know where I could find it? Its not on the ISO page with all the other formats. If the file has to be bought (I run in to places which could point to this), such information would also be very useful.
After hours and hours of searching, I found it in the ebics archive:
http://www.ebics.de/index.php?id=77 at the bottom of the page "Anlage3_Archiv_V2_5.zip"
Ebics is the official "Electronic Banking Internet Communication Standard" from the Deutsche Kreditwirtschaft.
I would download the SEPA Explorer, install it and then look in the install folder (SEPAExplorerV2\XSDFiles) for what you need (pain.008.002.02.xsd is in there).
Already generated Java bindings for the XSD you're looking for are in the Open Banking Tools (LGPL) here.
I'm working on implementing a blogging engine and want to make it compatible with windows live writer. I found an excellent post on this, but what I can't find is documentation on the placeholder tags used in the weblayout and webpreview templates. The only page I can find referring to this files on MSDN is this one that doesn't mention the placeholder tags.
Anybody point me in the right direction?
Many thanks
Tony
The placeholders are:
{post-title}
{post-body}
The msdn article that you linked to does state that, though.