Comments on: How to download Wikipedia https://www.rene-pickhardt.de/how-to-download-wikipedia/ Extract knowledge from your data and be ahead of your competition Tue, 17 Jul 2018 11:07:57 +0000 hourly 1 https://wordpress.org/?v=4.9.6 By: Fredrick Otieno https://www.rene-pickhardt.de/how-to-download-wikipedia/#comment-17008 Tue, 21 Jan 2014 13:14:37 +0000 http://www.rene-pickhardt.de/?p=249#comment-17008 Hi Rene
your post is very insightful it’s awesome, but i went about it a slightly different way…and i think a bit easier.. i used the wikitaxi to host the Wikipedia dump file. i donwloaded the dumnp file and the wikitaxi software as a torrent file first. you can opt to use the kiwix software too.. i hope that helps

]]>
By: Fredrick Otieno https://www.rene-pickhardt.de/how-to-download-wikipedia/#comment-17010 Tue, 21 Jan 2014 13:14:37 +0000 http://www.rene-pickhardt.de/?p=249#comment-17010 Hi Rene
your post is very insightful it’s awesome, but i went about it a slightly different way…and i think a bit easier.. i used the wikitaxi to host the Wikipedia dump file. i donwloaded the dumnp file and the wikitaxi software as a torrent file first. you can opt to use the kiwix software too.. i hope that helps

]]>
By: Download Google n gram data set and neo4j source code for storing it https://www.rene-pickhardt.de/how-to-download-wikipedia/#comment-17001 Sun, 27 Nov 2011 13:28:31 +0000 http://www.rene-pickhardt.de/?p=249#comment-17001 […] to learn those weights. As a training data set a corpus from different domains could be used (e.g. wikipedia corpus as a general purpose corpus or a corpus of a certain domain for a special […]

]]>
By: Download Google n gram data set and neo4j source code for storing it https://www.rene-pickhardt.de/how-to-download-wikipedia/#comment-17004 Sun, 27 Nov 2011 13:28:31 +0000 http://www.rene-pickhardt.de/?p=249#comment-17004 […] to learn those weights. As a training data set a corpus from different domains could be used (e.g. wikipedia corpus as a general purpose corpus or a corpus of a certain domain for a special […]

]]>
By: René G https://www.rene-pickhardt.de/how-to-download-wikipedia/#comment-17000 Wed, 16 Feb 2011 20:53:11 +0000 http://www.rene-pickhardt.de/?p=249#comment-17000 Interesting chain of trial and error. For some people maybe obvious, but nevertheless useful information.
More interesting: I’m eager to see the results you get out of the data set 🙂

]]>