I am using Textrank method for extracting keywords from text and I am able to print individual keywords along with their scores. But I am currently trying to output whole text with the keywords I extracted earlier be highlighted (encircled etc).
I'm not sure who your target audience is, but I think the simplest solution might be to programmatically generate hypertext (HTTP), for example, where the keywords are given a foreground/background color of your choice. In fact, this can see this as being quite useful.
SpaCy has visualization tools but I believe they are targetted at providing specific NLP visualizations. I don't think they offer what you want, which seems to be a canvas for present information.
Oh! If you want to hack a solution, you can try this:
Create a custom entity type in SpaCy and have SpaCy report your keywords as your new custom entity type. Then you can use the SpaCy Entity Visualizer to highlight your entities.
Related
Does someone have a recommendation of tagging tool for NER types in raw text?
The input for the tool should be a library of text files(.txt simple format) , there should be a convenient UI for selecting words and set the tag/annotation fit to selection, the output should be structural representations of the tags(e.gs tart index , last index, tag in a JSON format)
Founderof LightTag here
We provide a super convenient interface to do span annotations such as named entity recognition, classifications and relationships.
You can work as one labeler or bring in a team and LightTag will disribute work between everyone automatically (no more selecting files and remembering what you labeled already) .
You can upload your own suggestions and let labelers use those, or use LightTags built in model.
Of course you can annotate at the character level and highlight subwords or multi word phrases.
You can try https://github.com/lasigeBioTM/MER (bash)
see the demo at http://labs.fc.ul.pt/mer/
Online tools:
I guess Dataturks' POS tool should work fine for your use case, you can just upload your data and specify the labels. The UI seems convenient enough.
Here is the link:
https://dataturks.com
It's an online tool, so you can work with multiple people to get the tagging done.
The exact output format you are looking for is not supported, but the format can easily be converted to what you are looking for, the output is like: word___LABEL word2___LABEL , so a simple 2-line script can convert it to start and end index.
Offline:
Another tool you can check out is prodigy, it's a downloadable software and does similar things. Just that you might be willing to pay for it upfront.
https://prodi.gy
I'm using Freeling to analyse text in Spanish, but I have a question when it comes to customize the used dictionary. The specific example is that the word
morelos
is a singular masculine noun but is being split in two words and classified as follows:
more morar VMM03S0 1 -
los lo PP3MPA0 1 -
I've tried a wide variety of things from adding the word in the dictionary, which entry the following
morelos morelos NPMSS00
I've tried not using multiwords, but is also unsuccessful.
Can anyone recommend me what to do?
(Is there anywhere a comprehensive tutorial to understand use freeling?)
This is because the affixation module is considering this a clitic pronoun (morar+los)
You can deactivated affixation, or try to fine-tune the affixation rules.
There is comprehensive information about FreeLing in its user manual and in its user forums. Check FreeLing webpage
I'm trying to figure out how I can use Synonyms to only search against specific sites in the Annotations via the label - but can't find any information on how to do this.
Can you have Synonyms setup / linked to only search specific sites via labels? or how can you call the API to use a specific Synonym and pass which label to use and thus only search those sites that correspond to the label?
Any help / advice would be very much appreciated
Just to add we are using the Google Search API to fire off the searches and then return results programmatically
Managed to figure out how this should be done
Setup refinements and associated labels
Associate these to your annotations
Setup your Synonyms
You can now call your Synonyms and use the more: option to select the label you want use
I am working on a project to digitize approximately 1 million images for which metadata will be added to facilitate search.
Each image is, for example, a page in a dictionary. But not text. Just a static scanned image. OCR is not an option :(
My objective is to emulate the current search procedure which consists of looking up the alphabetical entries till the correct page is found. In absence of machine readable text, I am looking at tagging each page with Dictionary range tag. For Example (Apple-Canada). So if someone searches for "Banana", it should hit the (Apple-Canada) range Tag.
Is this supported in SharePoint out of the box? If not, is there an addon product which provides this functionality or am I looking at building a customized extension?
Any help will be appreciated :)
Installing the IFilter for TIF files is done with a couple of clicks and gives you free OCR along the way. Very good for scanned pages.
On your question though: No, SharePoint does not have any kind of "range" tags or fields. The only vaguely similar thing to what you are requesting is the Thesaurus of the search. There you could define acronyms and synonyms for words and it would actually search for something else. So you could enter Banana but it would actually search for Apple. Some examples here: How to: Customize the Thesaurus in SharePoint Search and Search Server.
Other than that I can only think of a custom implemented search provider giving you the flexibility you need.
I want to create visuals along the lines of CNN's "red-state, blue-state" shadings of the states in the U.S. for my project. I'm planning to do something fancier than just shading the state's shape in a color. Are there open source libraries of state shapes/polygons (or - if not open source - others) that I can import into Word, Excel, etc. that I can use to show complicated graphs based on states?
I have Map Point, but haven't been able to figure out how to shade the states in a complex way.
you could try google charts, it looks like http://www.woot.com is doing something similar to what you need
Here is a good example using google maps... I've used code like that before.. perhaps from this exact example.
http://econym.org.uk/gmap/example_states2.htm
EDIT: you might want to consider converting the states.xml into JSON... it'll be smaller (136k of XML right now!) and should load faster in most browsers.
There might be a couple parts to the question you are asking, but to address the first part "Are there open source libraries of state shapes/polygons...", here's a resource to check out:
http://commons.wikimedia.org/wiki/Category:SVG_maps_of_the_United_States
It's a list of various SVG(scalable vector graphics) files which can be imported into a number of applications. Basically a giant xml representation of lines and endpoints. This can be directly converted to XAML, if you're into a more programmatic way of charting(ie, C# w/ Silverlight).
However, to address the second part regarding MS Office, Visio can import SVG files for manipulation as well. I'm unsure what type of graphs you were looking for, but I hope this can assist in some small way on your path to awesomeness ;)