btw, gnowsis is not part of KDE – but its a good idea :-)

Due to current rumour that appears on the net and in chats around me because of my talk on wednesday at ISWC: gnowsis is not part of KDE. But nepomuk-kde is part of kde.

its all a bit complex: gnowsis was my diploma thesis and was continued as open source project at DFKI, where I started working in 2004. In 2006, NEPOMUK started as a EU project (hence NEPOMUK was then used as project name for an EU research project, not for software) which funded research on Semantic Desktop. Gnowsis continued a bit until about December 2006, when we reached version 0.9.2, which is the last release.

Since then, most of our energy goes into (“Psew”, nepomuk-server) which is a Java-based Semantic Desktop research prototype, and much work also goes into, which is a KDE based semantic desktop product.

nepomuk-KDE = product
gnowsis = Leo’s hibernating Semantic Desktop open source project
nepomuk-server = java based semantic desktop research prototype (=most many features, but less inteagration with os as KDE has)
psew = GUI for nepomuk-server (at the moment also bundles nepomuk-server)

All of them share the same concepts: RDF on the desktop, a PersonalInformationModel, Annotation of everyday things, embedding into existing applications (=thats where we differ from others like haystack).

but as people keep asking about it: I do long for a working semantic desktop, and porting gnowsis’ simplicity to KDE would indeed be nice.
How nice? Nice enough that you want to pay me something to do it?

Because I would like to continue working on this the next years (NEPOMUK EU project or not) …. the first six years were already quite rewarding ­čÖé

Aperture 1.2.0 out


Aperture is a Java framework for extracting full-text content and
metadata from various information systems (e.g. file systems, web sites,
mail boxes) and the file formats (e.g. documents, images) occurring in
these systems.


Download URL:

After three years of development Aperture is stable enough
to drop the .beta suffix from the release. 1.2.0 leverages
architectural improvements made in 1.1.0.beta to bring
support for compressed archives and to streamline
email processing. A completely new service – the
DataSourceDetector allows applications to provide
suggestions to users about the data sources on their
desktops. A host of bugfixes and minor improvements rounds
the image of the leanest and meanest version of Aperture
ever made. Enjoy.

What’s new?

  • a completely new Aperture service – the
    DataSourceDetectors – can be used to provide advice to
    the user about the data sources on the desktop
  • new subcrawlers for .zip, .gzip, tar and bzip2 compressed
  • unification of the email handling – now the ImapCrawler,
    MboxCrawler and the MimeSubCrawler use the same code in
    the DataObjectFactory to convert emails to RDF. The
    MimeExtractor has been deprecated, switch to
  • some bugfixes in the email handling code, plain text, and
    xml attachments are treated correctly, threads are
    reflected in the resulting rdf
  • the pdf extractor has some basic support for XMP metadata
    (thanks to JempBox)
  • a completely new XmlSafetyUtil class that helps to deal
    with characters that are valid in RDF, but invalid in XML
    thus breaking the serialization
  • the uris of subcrawled resources follow the pattern
    established by the Apache Commons VFS project.
  • new Sesame 2.2.1 bundled with Aperture features dramatic
    performance optimizations, e.g. the aperture test suite
    is 2 times faster, this may also be a boost for your

Best regards
Leo Sauermann
Christiaan Fluit
Antoni Mylka

Free Semantic Web Trendseminar, 5.11.2008, Stuttgart

Mittwoch, 05. November 2008
09:30 bis 14:00 Uhr
Haus der Wirtschaft Baden-W├╝rttemberg, Stuttgart

“Semantic Technologies have a high potential for Enterprises” …. The event is in German, nice speakers. Tassilo Pellegrini from is probably talking about their company experiences, I would guess we hear some nice war stories here.

Semantic-Web blogs about our Semantic Desktop talk

Jana Herwig from the team blogged about Brian Davis and my talk at the practical semantic web workshop.

The next session at WOD-PD was given by Leo Sauermann (German Research Center for Artificial Intelligence DFKI, Germany), and Brian Davis (DERI Galway, Ireland). Leo introduced the idea of the Semantic Desktop, and more specifically, the Nepomuk Social Semantic Desktop.


Malaysia first web 3.0 country

This was just forwarded to me by Andreas Dengel:
“A Malaysian government applied research agency wants the country to be recognized as the first Web 3.0 society in the world.
MIMOS is driving the semantic technology industry in the nation towards this goal.”

Read on at

“Semantic technology is driving the next generation of the Web, the semantic Web, a machine-readable Web of intelligent data and automated services that amplify the Web far beyond its current capabilities,” said Dato’ Wahab, at the agency’s Semantics Symposium in Malaysia.

A problem of semantic web: not providing XML value

Today in the morning, I had a sudden “insight” about one of the problems of RDF and Semantic Web: it misses some of the value that XML offers. This is what my daily commuting bike ride is for, thinking…

The adoption of Semantic Web rises and falls with the adoption of it in standardization bodies. For example the Oil&Gas industry of Norway is thinking about Semantic Web, and I have recently been talking with people from the automotive supplier industry about Semantic Web. To interchange data in a business-to-business environment you would expect that RDF has more features than XML, but in fact, it doesnt.

  • RDF is less expressive than XML. One example: you can’t define pattern in RDF. Look in the XML spec, there is much more of it missing in RDF.
  • RDF is not validated. Although in theory, it is possible to validate a file for semantic correctness, nobody does that because of the open world assumption. Hence, there is no validation of the XML in mainstream applications.

So, if you are an industry, you already havean XML based standard, moving to RDF without the expressiveness of XML and without the notion of validation is tricky. RDF should have more features, not less.

Also the stack of XML technologies must be embraced better, for example a XSLT-friendly RDF/XML serialization. Please, dear reader, solve these problems and make a company around it.

Springer’s “Social Semantic Web” is out – kauf mich.

On Friday we received our author copies of Springers new masterpiece of Semantic Web books. Springer’s “Social Semantic Web” – Web 2.0 – was nun”

Social Semantic Web Happy Authors

From left to right: M, Kinga Schumacher, Ansgar Bernardi, Leo Sauermann. We all were authors on the chapter on “Semantic Desktop”, Malte additionally contributed to the chapter on Semantic Wikis. As you see we are very happy about our complimentary author’s copy. Besides that – das buch ist gut.

Read it – in deutsch ­čÖé . The authors (besides our small contribution) are the who-is-who of the German-speaking Semantic Web. Chris Bizer, S├Âren Auer, Sebastian Schaffert, Kr├Âtsch&Vrandecic, etc …

edited by Andreas Blumauer and Tassilo Pellegrini, it brings together many authors from the practical side.