Open source search engines : alternatives to Lucene [closed] - search

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
Currently we are using Lucene for our search engine, but we want to look at some alternatives. I have looked at several on the net but seems like a lot of them are out of date or the development stopped. That is why I want to ask if you guys know any good open-source alternatives to Lucene that are still in development?
Kind regards,
Merlijn

Try Sphinx search http://sphinxsearch.com/. It is used by many NLP researchers.

If you are looking for an open source and Java based alternative, then you could try Terreir. Note that Terrier targets academia.
If the language is not an issue, then you could look at Xapian. I found its community quite active, and it has participated in Google Summer of Code several times.
Otherwise, you could try Whoosh, a python based search library.

FastcatSearch is also open source and java based alternative.
Lucene is a IR library as already you know, Solr is a search server, and FastcatSearch is a counterpart of Solr.
It provides web-based manager, so that you can set up configs easily.

Related

Solving Homophone Confusion [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 8 years ago.
Improve this question
This may be a question that is not suitable for stackoverflow, but I had no other better place to ask it. I was wondering if there are any known tools (non-commercial) that can be used to solve the homophone confusions such as these in a sentence?
it's vs its
you're vs your
I am new to NLP and I haven't used any of the known tools. Tried to search for these in google but nothing useful shows up. Are there any parts in NLTK or CoreNLP that cover this?
I have no experience with this topic but I found a how to PDF that may be of some use to you.
How to solve homophone problems
It's no complete solution, but LanguageTool has some rules for this. See the rule file and search for rulegroup id="IT_IS"(disclaimer: I'm the maintainer of LanguageTool). After the Deadline also uses a rule-based approach, only that it tries to avoid useless suggestions by filtering its suggestion against a large n-gram database.

UML diagram Display framework [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I am looking to make a prototype of a UML differencing tool. What open source visualization tool kit exist that allow me to display UML diagrams?
Yet, I still need to be able to make personal edits the display. (Ie. refocusing, and drop down lists, additional displays.)
I have looked at Zest and Perfuse. But neither of these applications have a way to display a UML. If they do work, where are examples of them displaying UMLs?
you may want to have a look at GEF. As far as I know, ArgoUML is based on this framework.
There are several open source UML tools for Eclipse listed here along with their license.
For the diff part, you should check EMFCompare and this list of model versioning tools since they could be reused in your project

What is the best haskell documentation available online? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
With Java, Groovy, and Python, it is easy to find the standard, complete, easy to navigate documentation for the language.
I'm starting to learn Haskell, and I'm not sure where to find this. In particular, it doesn't seem to exist at haskell.org.
If you scroll down on http://haskell.org/, on the left sidebar under "Libraries", you will find a set of documentation of all libraries (called packages), which might be what you're looking for. However, be aware that it is an extremely large database, because it includes user-submitted content as well as standard libraries.
The two online search engines "Hayoo" and "Hoogle" might be able to help you. You can search by function name, type signature, or both. As far as I know, the only difference is that "Hayoo" searches the entire database, whereas "Hoogle" searches the standard libraries plus a few common extras. I would recommend Hoogle, because it generally gives you what you want, and is in my experience more reliable.
While we're on the subject, I personally think that http://learnyouahaskell.com/ is a great resource for learning Haskell if you've never seen a functional language before.
http://www.haskell.org/ghc/docs/6.12-latest/html/
Sorry can't comment yet, so here.
Hoogle is not just online search engine. Check ghci integration.
I was just looking for the same thing:
Wiki: http://www.haskell.org/haskellwiki/Category:Haskell
Good Intro: http://learnyouahaskell.com/chapters
Not much help, but it's a start.
http://book.realworldhaskell.org/read/ is pretty good.
http://learnyouahaskell.com/
Give this a shot!
http://www.haskell.org/ghc/download.html#snapshots
Check out this intro to natuaral lang. processing: http://nlpwp.org/book/

Dreamweaver equivalent for Linux [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I am looking for an equivalent software to Dreamweaver in Linux.
It is not an exact match but it is based out of Eclipse which means super cross platform funky java love.
http://www.aptana.com/
Aptana Studio is actually what I replaced Dreamweaver with since Adobe bought Macromedia, I use it on Windows and Linux without trouble. But for the suggestion you will also get my 2 cents about Wysiwtf... it is almost never what you get. Some of the best code I have ever done in my life was done in SciTE (also available in Linux), it supports multiple coding languages and offers enough features to be useful without becoming bloated.
If you want something reasonably non-technical, then perhaps Kompozer?
Or, if you want more technical stuff, then you probably want Aptana.
Another mention bluefish.
Depending on what desktop environment you use I can recommend Quanta+ to you. It's part of the KDE SC but can also be used in other DEs.
You could also use KompoZer, it seems to be nice as well. Didn't test this one though.
I've also researched this for myself, and the answer is that, in my opinion, there is nothing comparable.
Most people choose Dreamweaver for its WYSIWYG (as good as it can be with HTML), and the ease of use. If you're looking for database connectivity, PHP debugging and the like, then Elipse beats Dreamweaver by a lot, but chance is the original poster is looking for the ease-of-use, so neither Bluefish nor Eclipse is going to satisfy him.

Could you recommend an unstructured data indexing software? [closed]

Closed. This question does not meet Stack Overflow guidelines. It is not currently accepting answers.
Questions asking us to recommend or find a tool, library or favorite off-site resource are off-topic for Stack Overflow as they tend to attract opinionated answers and spam. Instead, describe the problem and what has been done so far to solve it.
Closed 9 years ago.
Improve this question
I am collecting logs from several custom made applications. Each application has it's own log format. What I'm looking for is a central tool which would allow me to search through all of my logs. This means the tool would have to be able to define a different regex (or alike) for each log file (marking where a record begins, ends, and what are the fields). I've been trying Splunk, but I'm not happy with it, since performance are slow, I'm limited (free version) with the amount of indexed data per-day, and it's not as flexible as I want it to be.
Could you recommend a software (preferably free or cheap) for the task?
You can try Lucene. It is free. It is written in Java, and it allows full-text search over large amount of data. It is not a complete application, but rather a library, so you have to write code that uses it to index and to search your logs. You may have to define different document types or at least different indexing functions for your logs, but then search works beautifully.
If you can use Windows, try out Microsoft's best tool ever, Logparser. I wish there was such a simple tool for Unix. But there isn't. And although I've kept wanting to get around to making a Unix version of Logparser, I just haven't had the time.
Note: This would be a great project for someone with time on their hands or for a grad-student somewhere!
http://www.splunk.com/
Never used it, but have heard of it.

Resources