2011 is almost over and more than 9 months of my PhD have already passed by. During my math diploma I was founded by the german national academic foundation. Besides some really nice benefits that came along with this every 6 months I was forced to write reports about my study progress. Even though theses …
Tag: Wikipedia
Download Google n gram data set and neo4j source code for storing it
In the end of September I discovered an amazing data set which is provided by Google! It is called the Google n gram data set. Even thogh the english wikipedia article about ngrams needs some clen up it explains nicely what an ngram is. http://en.wikipedia.org/wiki/N-gram The data set is available in several languages and I …
Graphity: An efficient Graph Model for Retrieving the Top-k News Feeds for users in social networks
UPDATE: the paper got accepted at SOCIALCOM2012 and the source code and data sets are online especially the source code of the graphity server software is now online! UPDATE II: Download the paper (11 Pages from Social Com 2012 with Co Authors: Thomas Gottron, Jonas Kunze, Ansgar Scherp and Steffen Staab) and the slides I already said …
3 exercises for Sorting problems (Quicksort , Mergesort) in Algorithms and data structures class
#1: Sorting huge files Sorting big files might not be as simple as just implementing an sort algorithm. As soon as the file does not fit in memory any more smarter implementations have to be applied. One way is to sort the file on the hard disk. We remark that not every algorithm is easily …
3 exercises for Sorting problems (Quicksort , Mergesort) in Algorithms and data structures class
#1: Sorting huge files Sorting big files might not be as simple as just implementing an sort algorithm. As soon as the file does not fit in memory any more smarter implementations have to be applied. One way is to sort the file on the hard disk. We remark that not every algorithm is easily …
neo4j based social news feed demo on wikipedia graph running
UPDATE: you can find an evaluation of the following blog post and idea on: http://www.rene-pickhardt.de/graphity-an-efficient-graph-model-for-retrieving-the-top-k-news-feeds-for-users-in-social-networks/ Hey everyone I can finally demonstrate the neo4j and gwt system that I have been blogging about over the last weeks here and here. But please find the demo under the following adress: http://gwt.metalcon.de/GWT-Modelling The code will be available soon! …
Will Wikipedia replace scientific journals?
We all know that the review process of scientific journals is a good thing that served a certain purpose in the time it was created. In this video Jimmy Wales – well known as the Wikipedia founder – talks about the impact of new technologies and the web on the traditional concept of scientific journals …
Business Model of Metaweb with freebase
Today I want to talk about one of my favourite Internet start ups. It is called Metaweb. Metaweb was founded in 2005. To my knowledge it had two rounds of investment (all together about 60 Million US Dollar) and it was bought by Google in June 2010. So why do I love metaweb? Well they …
Wikipedia – UNESCO World Culture Heritage
This morning I received an email telling me about wikipedias plans to become an UNESCO world culture heritage. I right away was amazed by this idea! Think about it! Wikipedia is one of the greatest projects that came along in recent years. Education and knowledge are among the most important things in our society and …
How to download Wikipedia
Wikipedia is an amazing data set to do all different kinds of research which will go far beyond text mining. The best thing about Wikipedia is that it is licensed under creative common license. So you are allowed to download Wikipedia and use it in any way you want. The articles have almost no spelling …