• Happy Coding

Archive for the ‘Search&Find’ Category

Apache Lucene and Ruby

Robert Dempsey has written a nice article about ‘Using Amazon S3, EC2, SQS, Lucene, and Ruby for Web Spidering‘, which is very interesting because it describes how to interact with Apache Lucene using Ruby.

Reblog this post [with Zemanta]

next-generation architecture for Google’s web search

For the last several months, a large team of Googlers has been working on a secret project: a next-generation architecture for Google’s web search. It’s the first step in a process that will let us push the envelope on size, indexing speed, accuracy, comprehensiveness and other dimensions. The new infrastructure sits “under the hood” of Google’s search engine, which means that most users won’t notice a difference in search results. But web developers and power searchers might notice a few differences, so we’re opening up a web developer preview to collect feedback.
(see Google Webmaster Central)

Reblog this post [with Zemanta]

Clusty. My new favourite search engine.

It’s a shame that i didn’t recognize this thing of beauty before. :)

Clusty

searchCrystal. search visualization tool

searchCrystal lets you search and compare multiple engines in one place.
It is a search visualization tool that enables you to compare, remix and share results from the best web, image, video, blog, tagging, news engines, Flickr images or RSS
feeds.

Read the rest of this entry »

krugle code search engine

Just found krugle. A nifty search engine which helps you finding source-code by searching through projects which are available in the internet. It also allows you to add comments to the code.

Krugle helps programmers find existing source code and the information they need to evaluate, deploy and fix code.

There is a demo-video which shows all the features.

I found my code at Krugle

Grokker. Now and then.

I have tested Grooker — a really nice looking search-engine — some years ago, when it was a desktop-application. Seems that it has become a web-application since then. Its still nice looking and returns good results.grokker webapplication

Search Wikia

Thanks to Dave Winer’s posting, i have found out about Search Wikia.

The basic concept of the search project is that I want to create a completely transparent, open-source, freely licensed search engine, [...]
(more @ infoworld.com)

DBpedia II.

Just found another posting about DBpedia in a blog of one of the developers. It tells you a little bit more about the backgrounds of this great project.

Taking Wikipedia to the next level

Just read something about the following project:

DBpedia.org is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated queries against Wikipedia and to link other datasets on the Web to Wikipedia data.

Instead of searching the full-text of Wikipedia, it allows you to execute SPARQL-Queries on the data. With “the data” i mean RDF-Data. Because the goal of the DBpedia-project is to create a whole bunch of structured data out of the Wikipedia-texts.

A little search interface can be found at http://dbpedia.org/search/.

Artikel bezüglich Google und Datenschutz

Eben entdeckt und recht interessant geschrieben. Hier zu lesen.

Grokker2 als Beta-Version fuer Mac

Hier zum Runterladen.

Javadoc Dokumente finden, ohne Google zu nutzen

Wer kennt das nicht: Man ist am Programmieren und sucht die Javadoc Dokumentation zu einer Klasse oder Methode. Ich habe es bis jetzt immer so gehandhabt, dass ich mittels Google nach der Doku gesucht habe. Durch einige Weblog-Beitraege bin ich aber jetzt auf http://javadocs.org/ gestossen. Einfach Klassen- oder Packagename eingeben und ……… fertig!

Via the excellent Russell Beattie, a pointer to javadocs.org. Given the amount of Java programming I am apt to do in the next few years, I suspect this could save me several weeks out of my life, in aggregate. [...]

JavaDocs.org rules! It is quickly become as frequently used while I program as Google. [...]

grokker: Visualisierung von Suchergebnissen

grokker ist ein Programm zur Visualisierung von Suchergebnissen. Dabei greift grokker auf verschiedene Suchmaschinen (z.B. Google) zurueck. Die Ergebnisse werden danach in lustigen “Blasen” gruppiert:

grokker blasen

Testen konnte ich das Programm leider nicht, da keine Demo-Version fuer Mac verfuegbar ist — fuer Windows allerdings schon. Also wer Lust hat, findet hier die Demo.

Vivisimo — Hierarchische Kategorisierung von Suchergebnissen

Vivisimo ist eine Meta-Suchmaschine, die die Ergebnisse hierarchisch in Kategorien-Ordnern sammelt und somit ein wenig mehr Uebersicht in der Ergebnisliste schafft. Dabei nutzt sie alle bekannten Suchmaschinen != Google.
Hier mal ein Link zum Testen: RDF.