As I said yesterday I have been busy over the last months producing content so here you go. For related work we are most likely to use neo4j as core data base. This makes sense since we are basically building some kind of a social network. Most queries that we need to answer while offering …
Tag: data
Graphity source code and wikipedia raw data is online (neo4j based social news stream framework)
UPDATE: there is now the source code of an entire graphity server application online! 8 months ago I posted the results of my research about fast retrieval of social news feeds and in particular my graph index graphity. The index is able to serve more than 12 thousand personalized social news streams per second in …
Download Google n gram data set and neo4j source code for storing it
In the end of September I discovered an amazing data set which is provided by Google! It is called the Google n gram data set. Even thogh the english wikipedia article about ngrams needs some clen up it explains nicely what an ngram is. http://en.wikipedia.org/wiki/N-gram The data set is available in several languages and I …
Graphity: An efficient Graph Model for Retrieving the Top-k News Feeds for users in social networks
UPDATE: the paper got accepted at SOCIALCOM2012 and the source code and data sets are online especially the source code of the graphity server software is now online! UPDATE II: Download the paper (11 Pages from Social Com 2012 with Co Authors: Thomas Gottron, Jonas Kunze, Ansgar Scherp and Steffen Staab) and the slides I already said …
Download your open source Youtube insights statistics tool
Today I have figured out that you can download a lot of statistics about your on videos from Youtube. That is actually very nice since I was always sceptical that you miss the knowledge of who is watching your videos once you do not host them yourself. Unfortunately there are some drawbacks to these statistics: Youtube …
My Blog guesses your name – Binary Search Exercise for Algorithms and data structures class
Binary Search http://en.wikipedia.org/wiki/Binary_search_algorithm is a very basic algorithm in computer science. Despite this fact it is also important to understand the fundamental principle behind it. Unfortunately the algorithm is tought so early and the algorithm is so simple that beginning students sometimes have a hard time to understand the abstract principle behind it. Also many …
Download network graph data sets from Konect – the koblenz network colection
UPDATE: now with link to the PhD thesis. By the time of blogging the thesis was not published. thanks to Patrick Durusau for pointing out the missing link. One of the first things I did @ my Institute when starting my PhD program was reading the PhD thesis of Jérôme Kunegis. For a mathematician a …
Business Model of Metaweb with freebase
Today I want to talk about one of my favourite Internet start ups. It is called Metaweb. Metaweb was founded in 2005. To my knowledge it had two rounds of investment (all together about 60 Million US Dollar) and it was bought by Google in June 2010. So why do I love metaweb? Well they …
Google, Facebook & co. are not free!
Besides ecommerce one really big Internet business model obviously is the trade or monetarization of data. This Blogpost reminded to write an article about this topic. I think the author points out the most important facts! Services like Facebook / gmail / Youtube and so on are not free. Moneywise they are free of charge but …
Where to upload your music – The perfect Band website: part 5
While writing my article on link baiting for musicians I realized that this topic is overdue. besides the fact that Facebook is highly overrated this is probably the most valueable piece of advice I wrote in my blog so far. Quick answer: “where to upload your music”: You upload all of your music to the …