Apache Lucene and Ruby
Robert Dempsey has written a nice article about ‘Using Amazon S3, EC2, SQS, Lucene, and Ruby for Web Spidering‘, which is very interesting because it describes how to interact with Apache Lucene using Ruby.
Robert Dempsey has written a nice article about ‘Using Amazon S3, EC2, SQS, Lucene, and Ruby for Web Spidering‘, which is very interesting because it describes how to interact with Apache Lucene using Ruby.
For the last several months, a large team of Googlers has been working on a secret project: a next-generation architecture for Google’s web search. It’s the first step in a process that will let us push the envelope on size, indexing speed, accuracy, comprehensiveness and other dimensions. The new infrastructure sits “under the hood” of Google’s search engine, which means that most users won’t notice a difference in search results. But web developers and power searchers might notice a few differences, so we’re opening up a web developer preview to collect feedback.
(see Google Webmaster Central)
It’s a shame that i didn’t recognize this thing of beauty before. :)
searchCrystal lets you search and compare multiple engines in one place.
It is a search visualization tool that enables you to compare, remix and share results from the best web, image, video, blog, tagging, news engines, Flickr images or RSS feeds.
Just found krugle. A nifty search engine which helps you finding source-code by searching through projects which are available in the internet. It also allows you to add comments to the code.
Krugle helps programmers find existing source code and the information they need to evaluate, deploy and fix code.
There is a demo-video which shows all the features.
I have tested Grooker — a really nice looking search-engine — some years ago, when it was a desktop-application. Seems that it has become a web-application since then. Its still nice looking and returns good results.
Thanks to Dave Winer’s posting, i have found out about Search Wikia.
The basic concept of the search project is that I want to create a completely transparent, open-source, freely licensed search engine, [...]
(more @ infoworld.com)
Just found another posting about DBpedia in a blog of one of the developers. It tells you a little bit more about the backgrounds of this great project.
Just read something about the following project:
DBpedia.org is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia and to link other datasets on the Web to Wikipedia data.
Instead of searching the full-text of Wikipedia, it allows you to execute SPARQL-Queries on the data. With “the data” i mean RDF-Data. Because the goal of the DBpedia-project is to create a whole bunch of structured data out of the Wikipedia-texts.
A little search interface can be found at http://dbpedia.org/search/.
Eben entdeckt und recht interessant geschrieben. Hier zu lesen.
Hier zum Runterladen.
Wer kennt das nicht: Man ist am Programmieren und sucht die Javadoc Dokumentation zu einer Klasse oder Methode. Ich habe es bis jetzt immer so gehandhabt, dass ich mittels Google nach der Doku gesucht habe. Durch einige Weblog-Beitraege bin ich aber jetzt auf http://javadocs.org/ gestossen. Einfach Klassen- oder Packagename eingeben und ……… fertig!
Via the excellent Russell Beattie, a pointer to javadocs.org. Given the amount of Java programming I am apt to do in the next few years, I suspect this could save me several weeks out of my life, in aggregate. [...]
JavaDocs.org rules! It is quickly become as frequently used while I program as Google. [...]
grokker ist ein Programm zur Visualisierung von Suchergebnissen. Dabei greift grokker auf verschiedene Suchmaschinen (z.B. Google) zurueck. Die Ergebnisse werden danach in lustigen “Blasen” gruppiert:

Testen konnte ich das Programm leider nicht, da keine Demo-Version fuer Mac verfuegbar ist — fuer Windows allerdings schon. Also wer Lust hat, findet hier die Demo.