Two days ago I arrived in Erfurt in order to visit the federal competition young scientists (Jugend Forscht). I reported about the project typology by Till Speicher and Paul Wagner which I supervised over the last half year and which already won many awards. Saturday night they have already won a special award donated by …
Category: Data
Smartphones of Policemen could give criminals a competitive advantage
If I were a criminal I would create a smart phone app which would give me the possability to geographically and socially track policemen. Here some background on this thought. Yesterday I was sitting in the German summit on “Facebook Goolgle & Co – Chances and Risks” (which I will blog about soon) But today …
PhD proposal on distributed graph data bases
Over the last week we had our off campus meeting with a lot of communication training (very good and fruitful) as well as a special treatment for some PhD students called “massage your diss”. I was one of the lucky students who were able to discuss our research ideas with a post doc and other …
Wishlist of features for a distributed graph data base technology
I am just dreaming this does not exist and needs to be refined in a later stage. Fast traversals: Jumping from one vertex of the graph to another should be possible in O(1) Online processing: “Standard queries” (<–whatever this means) should compute within miliseconds. As an example: Local recommendations e.g. similar users in a bipartite …
Download Google n gram data set and neo4j source code for storing it
In the end of September I discovered an amazing data set which is provided by Google! It is called the Google n gram data set. Even thogh the english wikipedia article about ngrams needs some clen up it explains nicely what an ngram is. http://en.wikipedia.org/wiki/N-gram The data set is available in several languages and I …
Download network graph data sets from Konect – the koblenz network colection
UPDATE: now with link to the PhD thesis. By the time of blogging the thesis was not published. thanks to Patrick Durusau for pointing out the missing link. One of the first things I did @ my Institute when starting my PhD program was reading the PhD thesis of Jérôme Kunegis. For a mathematician a …
Amazed by neo4j, gwt and my apache tomcat webserver
edit: the demo is finally online but on a different data set though: check out the demo and read about the new data set. An evaluation of graphity can be found here Besides reading papers I am currently implementing the infrastructure of my social news stream for the new metalcon version. For the very first …
Download Trec (= Text Retrieval Conference) Data Set
Being back in University I get to see more and more data sets. Origninally I wanted to use the data sets category of my blog to provide an unordered list of these publicly available data sets sort of as a personal reminder. For some reason I never really did that but I am now about …
Social news streams and time indices on graphs for social networks
Last week I had a meeting with my PhD advisor and we talked about my ideas on social news streams and how you could implement them using graph data bases. Of course my choise here would be neo4j. Yesterday I had my first talk in our “Oberseminar” which is a weekly meeting of all PhD …
Risks and criticisms of Google's Data liberation front
I have wanted to write about Google’s data liberation front from some time. The data liberation front is an effort by Google to make your data from Google products available to you once you decide not to use them any more. Today my former student Martin was faster in his blog and created an excelent …