Seit den Anf ängen des World Wide Web gewinnt die Informationsextraktion (IE) auf semistrukturierten und natürlich-sprachlichen Texte immer mehr an Bedeutung. Auch die vorliegende Masterarbeit widmet
sich diesem umfangreichen Gebiet und konzipiert und entwickelt ein System zur Extraktion von Informationen aus Internet-Quellen. Dabei stehen Informationen im Vordergrund, die im Projekt
“track-u” der track-u mobility services GmbH, einer mobilen Anwendung zugute kommen sollen.
Das System basiert dabei auf der Nutzung eines Multiagentensystems (MAS), welches die Aufgabe der Quellen-Überwachung übernimmt und mit Hilfe der FIPA-kompatiblen JADE-Agentenplattform umgesetzt wird.
Zusätzlich widmet sich die Arbeit neben der Analyse bestehender IE-Techniken und der Entwicklung eines eigenen Wrappers auf Basis des MAS auch dem Entwurf und der Umsetzung einer Webapplikation, die zur Verwaltung des Systems dient. Dabei kommen Techniken wie das MVC Webapplikations-Framework Ruby on Rails und XML-RPC zum Einsatz.
The Big Word Project. Nice idea to finance the studies of the two guys who created this project, but useful?
It reminds me of projects like themilliondollarhomepage or this guy (i have forgotten the name) who had printed numbers on canvases and sold them for a price which was calculated using a primitive formula.
Mock-ups are also used in the Consumer goods industry, as part of the product development process, when the size, impression and/or artworks have to be tested and approved.
Mock-up is also a frequently used term when talking about an early layout or sketch of a Web site or program. (Wikipedia)
SeatBelt is a Firefox plug-in that assists you when signing in to OpenID sites with your PIP URL. Typically, if you are not signed into your PIP account when you access a sign in page using OpenID, you need to access your PIP account and sign in. Since you must do this within the same browser window, you have to navigate away from the page you wish to sign in to.
Like i’ve said in my last posting i have played with the Ruby AWS:S3 library a little bit. The following happened:
After installing the library with…
sudo gem install aws-s3
…i uploaded the file ‘test.pdf’ using the following script:
# requiring necessary gems
require 'rubygems'
require 'aws/s3'
# some name definitions
file_to_upload = 'test.pdf'
name_of_bucket= '#NAME OF YOUR BUCKET#'
name_of_file_in_bucket = 'uploaded.pdf'
# Create a connection to S3
AWS::S3::Base.establish_connection!(
:access_key_id => '#YOUR ACCESS KEY#',
:secret_access_key => '#YOUR SECRET ACCESS KEY#'
)
# Create a bucket. Only necessary if there is not already one.
AWS::S3::Bucket.create(name_of_bucket)
# Store the file...
AWS::S3::S3Object.store(
name_of_file_in_bucket,
File.open(file_to_upload),
name_of_bucket,
:content_type => 'application/pdf'
)
That was all. After these steps i had a file named “uploaded.pdf” in my bucket. Seems to be a really easy to use this library. :)
It is necessary to redirect the output of the child process (here: /dev/null). Also if the parent process dies, the child process will run independently.
I got an invitation for “ZenZuu“, which is another social-network-bla-bla:
ZenZuu is a next generation social networking platform that lets you create a profile similar to other ancient social networks, except our leading edge technology allows you to make money by signing up your friends and receive revenue sharing.
I don’t know. But for me the focus lies too much on money. Not making friends, not having fun etc. — just making money. And that makes the network not very friendly. You see ads everywhere, have a complex privacy policy and i don’t really find the information who stands behind this project. But thats only my opinion after some minutes of use. ;)
Just visited Twitter to see how they implemented their Captcha functionality and then i saw that they use reCAPTCHA. I’ve never heard of this before, but it seems to be really cool and easy to use:
reCAPTCHA helps prevent automated abuse of your site (such as comment spam or bogus registrations) by using a CAPTCHA to ensure that only humans perform certain actions.
[...]
reCAPTCHA is a Web service. [...] For many applications and programming languages such as WordPress and PHP we also have easy-to-install plugins available.
Schöner Vortrag von Prof. Viktor Mayer-Schauberger über das Vergessen im digitalen Zeitalter und Informationsökologie im allgemeinen. Aufgenommen wurde das Video auf der re:publica’08. Themen sind u.a. Verfallsdatum von digitalen Daten, der sorglose Umgang mit privaten Daten, Zukunft der sozialen Netze und vieles mehr. Sehr sehenswert.
Image from WikipediaJust found Zemanta on the Web and i am just using it when writing this article. Its a Firefox Plugin and searches your written words in WordPress for example and suggests pictures and other articles which you can link to your posting.
Very nice if you are lazy and want to tune up your article with some nice stuff — like i do now ;). And it has a really cool integration in WordPress.
Despite of this, the first thing that come to my mind when i read about this tool, was the copyright problem with embedded pictures. But they solved it apparently:
[...] All content that we are suggesting is copyright cleared – either clearly licenced as Creative Commons, or approved by stock providers.