Over one year ago I was starting to think about indexing scored stings for auto completion queries. I stumbled upon this problem after seeing the strength of the predictions of the typology approach for next word prediction on smartphones. The typology approach had one major drawback: Though its suggestions had a high precision the speed …
Tag: Information retrieval
Typology Oberseminar talk and Speed up of retrieval by a factor of 1000
Almost 2 months ago I talked in our oberseminar about Typology. Update: Download slides Most readers of my blog will already know the project which was initially implemented by my students Till and Paul. I am just about to share some slides with you. They explain on one hand how the systems works and on …
Building an Autocompletion on GWT with RPC, ContextListener and a Suggest Tree: Part 0
Over the last weeks there was quite some quality programming time for me. First of all I built some indices on the typology data base in which way I was able to increase the retrieval speed of typology by a factor of over 1000 which is something that rarely happens in computer science. I will …
Foundations of statistical natural language processing Review of chapter 1
Due to the interesting results we found by creating Typology I am currently reading the related work about query prediction and auto completion of scentences. There is quite some interesting academic work available in this area of information retrieval. While reading these papers I realized that I am not that strong in the field of …
Typology using neo4j wins 2 awards at the German federal competition young scientists.
Two days ago I arrived in Erfurt in order to visit the federal competition young scientists (Jugend Forscht). I reported about the project typology by Till Speicher and Paul Wagner which I supervised over the last half year and which already won many awards. Saturday night they have already won a special award donated by …
Smartphones of Policemen could give criminals a competitive advantage
If I were a criminal I would create a smart phone app which would give me the possability to geographically and socially track policemen. Here some background on this thought. Yesterday I was sitting in the German summit on “Facebook Goolgle & Co – Chances and Risks” (which I will blog about soon) But today …
PhD proposal on distributed graph data bases
Over the last week we had our off campus meeting with a lot of communication training (very good and fruitful) as well as a special treatment for some PhD students called “massage your diss”. I was one of the lucky students who were able to discuss our research ideas with a post doc and other …
Paul Wagner and Till Speicher won State Competition "Jugend Forscht Hessen" and best Project award using neo4j
6 months of hard coding and supervising by me are over and end with a huge success! After analyzing 80 GB of Google ngrams data Paul and Till put them to a neo4j graph data base in order to make predictions for fast scentence completion. Today was the award ceremony and the two students from …
Google Video on Search Quality Meeting: Spelling for Long Queries by Lars Hellsten
Amazing! Today I had a discussion with a coworker about transparency and the way companies should be more open about what they are doing! And what happens on the same day? One of my favourite webcompanies has decided to publish a short video taken from the weekly search quality meeting! The proposed change by Lars …
Related-work.net – Product Requirement Document released!
Recently I visited my friend Heinrich Hartmann in Oxford. We talked about various issues how research is done in these days and how the web could theoretically help to spread information faster and more efficiently connect people interested in the same paper / topics. The idea of http://www.related-work.net was born. A scientific platform which is …