microsoft manager: “needed a robust and scalable RDF repository” –

In an easily overlooked Press release by Intellidimension, an interesting statement about RDF is done.

Read the interesting part first:
“We chose to integrate the Intellidimension solution based on the strength of its technology and the company’s semantic web expertise,” said John Deutscher, group program manager of Industry Solutions at Microsoft Corp. “We needed a robust and scalable RDF repository with tight integration into SQL Server 2005 to meet our goals for housing extensible metadata in Interactive Media Manager, and RDF Gateway addressed our requirements.”

intellidimension logoWell done! Managing media with this setup is probably very clever.
Intellidimension is the company that does RDFGateway, a compact, good-performaning RDF application building platform (store + gui).

The whole press release:
Brattleboro, VT, USA, May 21, 2007 – Intellidimension announces that Microsoft has integrated Intellidimension RDF Gateway in their new product, Microsoft Interactive Media Manager. Interactive Media Manager is a server-based solution which enables organizations to better manage the complexity inherent in the digital content lifecycle, from content creation to management to distribution. The solution, an extension of Microsoft Office SharePoint Server 2007, is specifically designed for organizations in the media and entertainment industry, including film, television & postproduction studios, broadcasters, game developers, publishers and advertising and marketing agencies.

We’re excited about our work with Microsoft, which combines Intellidimension technologies with their recently introduced Interactive Media Manager solution,” said co-founder and CEO of Intellidimension Geoff Chappell. “It’s inline with our business goals and is an important vote of confidence in our technology. It’s also a great sign for the semantic web in general that a technology leader such as Microsoft is embracing semantic technologies.”

As part of this integration, Intellidimension enhanced RDF Gateway to support Microsoft SQL Server 2005 as a storage engine for its RDF deductive database. This integration with SQL Server will be available widely with the next release of RDF Gateway, providing Intellidimension enterprise users all of the capabilities of RDF Gateway with the added reliability and manageability of SQL Server for their mission critical data.

“We chose to integrate the Intellidimension solution based on the strength of its technology and the company’s semantic web expertise,” said John Deutscher, group program manager of Industry Solutions at Microsoft Corp. “We needed a robust and scalable RDF repository with tight integration into SQL Server 2005 to meet our goals for housing extensible metadata in Interactive Media Manager, and RDF Gateway addressed our requirements.”

About RDF Gateway

RDF Gateway is a high-performance, scalable semantic web server with a RDF deductive database at its core. Its dynamic scripting environment, rules-based inference, and host of other powerful features enable rapid development and deployment of semantic web applications. Its scalable architecture and proven performance keep applications running as data requirements grow.

About Intellidimension

Intellidimension, founded in 2000, provides solutions to satisfy today’s demands for richer information. The company supports the vision of a Semantic Web in which information is easily shared and re-purposed, and where a new generation of tools offer increased information automation by harnessing the power of networked, intelligent data. Its technologies enable the development of practical applications incorporating this vision.

Thanks to Intellidimensions CEO Geoff Chappell to point me to the press release.

Now Queryable and open linked data: U.S. Census/Congress datasets: 1 billion triples

and its fast!

As you can’t blog enough about it, I am copying a story from this announcement email:
(following Text by Josh Tauberer)

Hi, everyone. (This is a revised/combined reannouncement for what was
originally posted on the Linking Open Data list.)

Last November, Chris Bizer wrote, “[T]he DBLP server increases the size
of the Semantic Web by around 10 percent ;-)” [1] Based on the same
logic, I have recently increased the size of the semantic web by 200%!
(in terms of the number of triples; and of course I’m also just joking
here w.r.t. size of the semantic web)

I’m announcing here a new U.S. 2000 Census dataset of 1 billion triples,
accessible over SPARQL and browsable by linked data [2] principles, and
re-announcing my U.S. Congress dataset which is newly browsable with
linked data principles. These two datasets are interconnected, and the
Census dataset is linked up via owl:sameAs to Geonames [3].

I like the Census data set a lot for three reasons— first, if you live
in the U.S. it has something for you, since it has detailed statistics
on geographic entities down to the level of small towns/villages, and
everyone lives somewhere; second, it meshes up with two other data sets;
and third, it’s rich enough on its own to support a wide array of
interesting and real-world useful queries (if, say, you were doing
research).

The OpenLink guys were kind enough to host the data set previously, but
I wanted to push the limits of my own semweb C# library [4] and I wanted
to be able to revise the data set as needed, so I’ve wanted to host it
myself, which only recently I was able to do (even though I’ve had the
triples laying around for nearly a year).

A complete description of the data set and how it was constructed and
exposed is here:

http://www.rdfabout.com/demo/census/

Some features of the data set:

Data on 3,200 U.S. counties, 36,000 “towns”, 16,000 “villages”, 33,000
ZCTAs (something like zip-codes), and 435 congressional districts.

Each of those locations contains around 10 thousand population
statistics, as well as a dc:title, a basic hierarchical structure
between regions, and latitude/longitude.

Very basic geographic/name/lat-lng data (1 million triples) can be
downloaded in N3.

All of the 1 billion triples are accessible via SPARQL. See:
http://www.rdfabout.com/demo/census/sparql.xpd which has a few sample
queries. An example query is “List the states in the United States that
have more students in dorms than prisoners.”

The URIs for the geographic regions are dereferencable http: URIs. (The
URIs for the predicates in the data set will be updated to be
dereferencable in the future.) For example, you can visit the URI for
New York State:

http://www.rdfabout.com/rdf/usgov/geo/us/ny

(Some URIs return very large pages that take Firefox quite a while to
render. That one’s OK.)

The dereferencable URIs return 303’s to SPARQL DESCRIBE pages describing
those URIs.

There is a sitemap.xml file based on the latest draft circulated [5],
referenced from robots.txt: http://rdfabout.com/robots.txt

And, source code to generate the triples from the Census download files
are posted. It’s too large for me to provide the whole RDF myself, for
now at least.

The U.S. Congress data set, which I originally made SPARQL-accessible in
December 2005 but is now revised to follow the new linked data
principles, has 12 million triples containing brief biographical data
for all members of Congress, and mainly data for federal legislation and
voting records going back a number of years. Here are two example
dereferencable URIs:

http://www.rdfabout.com/rdf/usgov/congress/people/M000303
(= Senator John McCain)

http://www.rdfabout.com/rdf/usgov/congress/109/bills/h867
(= a bill in Congress)

Some example Congress-related queries are posted here:
http://www.govtrack.us/sparql.xpd
And dump files are here:
http://www.govtrack.us/data/rdf/

An example I like to use is that one could fairly easily create a table
using SPARQL aligning votes on a particular bill by congressmen with,
for instance, the median commuting time to work of their constituents,
as reported by the Census.

Thanks to those who gave feedback on the LOD list — I haven’t been
able to address all of it yet (like how to deal with backlinks on the
dereferenced pages).

[1] http://lists.w3.org/Archives/Public/semantic-web/2006Nov/0008.html
[2] http://linkeddata.org/
[3] http://www.geonames.org/
[4] http://razor.occams.info/code/semweb
[5] http://sw.deri.org/2007/07/sitemapextension/


– Josh Tauberer

http://razor.occams.info

Foaf VS Walled Communities

The web 2.0 is getting richer based on the proprietary walls.
There is an often asked question: why is FOAF out there for years but nobody made a business model out of it? Easy: because when you open your service to be replaceable, you are replaceable. Imagine a world where you could switch your community website like you can already switch your newsreader today (thanks to OPML): Venture capitalists would sweat like a drunken grad student in final exam.

This post is partly inspired by this. There is a mix up of cause and effect: FOAF as a standard is making Web 2.0 services interoperable and standardized. Orkut, facebook, linkedin, studivz, (++) would all have the same API and an extensible data format if they used FOAF and RDF.

But wait – who is the venture capitalist behind these web 2.0 walls? Maybe you. So you suddenly realize that when your little nerds in the computer room switch the lever towards standards, your money may go down the well, because then your precious closed community of people, and that is what you sell and own: data about people, will be open for anyone else to copy. So you would do your best not going for standards but instead making the BEST social service EVER so that everyone DIGGS it and invites all HIS FRIENDS into closed walls. Capitalism is ok, but we have to name it what it is.

Like Dick Hardt from SXIP said in his well-known keynote: its your data, not theirs. FOAF and RDF is a way to get back your data from the web 2.0 companies that own you at the moment, make them give you back the data you have entered. Switching from Flickr to another photo community should be as easy as switching newsreader (thx to OPML) or Office Application (thx to OpenDocument).

Microsoft does not go crazy for a standardized OpenDocument format, which would help us get free from the monopoly that takes your money when buying a computer and invests it to sell XBoxes to your kid. Why should it be different with web 2.0? But thanks for pointing us to the VC view.

Doing Guest Research at L3S

Update from last post, I am in Hannover engaged in Research. Julien Gaugaz from L3S and I are working on the topic of Current work Context of a user for Nepomuk. This is tricky, as first of all, there are about 600 definitions of “context” (there is a paper about analyzing 66 of them using text analytics, context buzzword word that everyone uses with its own interpretation!) and second, we have multiple existing ontologies and implementations to integrate.

btw, this is the room for the Hiwis and guest researchers where I sit at the moment:
Here I sit
I will blog about our extensive literature research on context later this week.

In the meantime, Cedric Mesnage from Lugano interviewed all DFKI and L3S members about what they are working on in the project, to improve the architecture of Nepomuk and communicate it better, a good task.

We drank beer and looked at the rainy (no, its not raining, its only dripping, Hannoverians have 100 words for rain) city of Hannover.
To Heiko Haller I said in a chat, that we will probably end up in a Fiasko, but actually we started there:
Es fängt im Fiasko an

Besides, Cedric can not only do great art, he can also do photographs (bummer that the autofocus didn’t get this one perfect):
Leobard Light Angel

Research visit at L3S

I am currently guest researching at the L3S lab in Hannover. L3S does research on Semantic Web, e-learning, search. They are partner in the NEPOMUK project, and I will be working on the ideas of the context a user currently works in, to guess what somebody is doing based on observations done. This is important to give support to people, showing them information that is related to the task you are doing at the moment. (for example: you are reading this blog-post, and related information is l3s, dfki, nepomuk, leo sauermann, context).

I will be at L3S for three weeks, from 24.9.2007 to probably 14.10.2007.

Here is the building where I am sitting at the moment: KBS building with L3S people

Where I sitIts the L3S offices at KBS, Appelstraße 4, 30167 Hannover. I am sitting in room 232. If you want to use this occassion to meet me, skype or jabber me first (check my foaf for how to).

Appartments from Gaststätte Kaiser I am living not far from that in an appartment room, where I have access to a kitchen and a bed, and free wifi for 90 minutes per day, great, what else do you need.

Got Schmapped

Schmap.com is a website providing dynamic travel guides. Instead of using boring stock photos, they also use flickr photos to illustrate their guides. I was contacted by them if they could use one of my photos for their guide (thanks to CC this all works).
Update (27.9.2007): As the photo wasn’t CC licensed (Thanks Maggi for pointing it out), I have to add that they asked me if they could do it. Hm, I wonder if they look for good pictures or for CC pictures.

Now, their guide is updated and here is the section where they use my photo (taken during ISWC 2005, of the conference dinner 🙂

They have a schmaplet to embed the post in blog-posts, but it seems that my blog-provider doesnt allow i-frames, so
click yourself.

This is the way to go: web 2.0 business!

Cyberspace “wonderland” by sun

SUN researcher Nicole Yankelovich presents an open-source 3-dimensional virtual environment, that can do many tricks, including application sharing in a like-VNC way.

Its called project wonderland, and builds on darkstar, an open source gaming server platform. Underneath, there is looking glass, an open source 3d platform.

Its a business-oriented, knowledge-work oriented environment, published open source. Unlike secondlife, which is closed-source and leisure-oriented.

With systems like this and all data represented as RDF on a Semantic Web, using Semantic Desktop applications, users could intuitively place their work in a virtual environment and share their ideas. Looking forward to see how this integrates with Semantic Web.

thanks to Henry Story for the link and his story on wonderland.

Microsoft Astoria on RDF: “no use” and pushing own format

Mike Bergman blogged about a new release of Microsoft Astoria, where they announce support for Atom, JSON (who-hoo), and Web3s.

I doubt that this can be taken as a signal from the whole Microsoft Corporation but is rather the view of project members of this one project. Nevertheless, its going to be interesting what Microsoft will do: bunker themselves in (terran style) and hope that their customers will never hear of open standards? I doubt, and therefore I do not fully agree with Mike:
This is yet another stunning and lame attempt by Microsoft to replace open standards with proprietary ones. Get a clue, Redmond!

They probably got their clue already, but for big companies it always takes time… HP (Jena), IBM (SLRP) and SUN are faster this time.

Triple-I 2007 Wrapup

As blogged before, I attended the Triple-I conference 2007.

triple-i logo

Summing it up:

  • There are pictures online from me, bblfish (Henry Story), and others.
  • The four Keynotes were interesting, and gave insights both from scientific view (innovation, trends)
  • as from the corporate view (buzzwords, trends, where to invest)
  • DFKI contribute quite much in the field of Knowledge Management, we had several papers published on: the Nepomuk Social Semantic Desktop (with many co-authors), the PIMO personal information model, the ConTag tagging system, Philosophy and Cognitive aspects of Semantic Desktops, and E-Learning
  • Martin Memmel also blogged about his trip to Graz

Papers were I contributed were:

Keynotes:
 Keynote Marc Smith demoing Vista "Tagging Game"

The Triple-I is balanced between science, technology fair with customer contacts, and social networking event. For example, enjoy the social event:

Social Event

Klaus Tochtermann and the Know-Center are the main organizers, the newly re-branded Semantic Web Company assisted, as did Salzburg Research. Hermann Maurer was omnipresent, as scientific father of many innovations.

Goodbye Handshake

Here is a short summary of the conference, extending my previous post.
Triple-I was this year not-so-organized during the paper submission phase (they changed paper length once) but on-site, its very well organized. The keynotes are at 09:30, leaving enough time to get your morning coffee. Between the sessions are long 1h breaks to knit your social network. I met many old and new aquaintances and friends. Fine conference.

At the evening of the first day, there was a welcome reception at Minoritensaal.

Second day I presented a haptic personal semantic network, a birthday present from my university friends:

In the afternoon of Thursday, the second day, the FFG and the Innovation Relay Centre Austria organised an International Cooperation Event. I represented DFKI at this Event and revived contacts with old partners and found new possible partners, alltogether a speed dating with 10 people, 20 minutes each. Exhausting, but worth it.

Friday Evening, Andreas Blumauer organized a dinner I was very happy to attend.

Andreas Blumauer:
Andreas Blumauer

Here is what they say about themselves:
The TRIPLE-I Conference series is a joint venture of the conferences I-KNOW, I-MEDIA and I-SEMANTICS. TRIPLE-I reflects the increasing importance and convergence of knowledge management, new media technologies and semantic systems. This unique concept aims at bridging the gaps between the various communities and their technology fields.

Building upon the reputation of the premium conference series I-KNOW and SEMANTICS the upcoming event addresses representatives from academia, industry and public administrations. TRIPLE-I will offer its participants a unique platform either to present latest and leading edge developments or to catch up with the developments of most innovative IT technologies, content applications, business models and emerging market opportunities.