[tor-reports] GSoC: Weekly report for ahmia, week 26
juha.nurmi at ahmia.fi
Fri Jun 27 15:46:27 UTC 2014
-----BEGIN PGP SIGNED MESSAGE-----
In this week:
Currently, ahmia.fi uses YaCy software that is P2P web crawler with
many features. There is one extremely good reason to use YaCy: it
could be scaled effectively if we would publish easy installation
script or ready-made VM machine or tutorial to join our voluntary
search network. After this we could crawl all the time and keep the
* I am planning public open YaCy back-end for everyone:
https://github.com/juhanurmi/ahmia/issues/14 but this
results that there has to be VPN-meshnet between YaCy nodes now :( My
good friend Mikko (kordex) has configured YaCy nodes and VPNs with
cfengine. He is the expert with YaCy and will look this when he has time.
* I Looked through some other free crawlers too. Heritrix web crawler
may be a good alternative to YaCy if one does not need P2P properties.
Still, I personally prefer building free open voluntary search network
* I chatted with Chris MacNaughton who has built the
https://torsearch.es/ and is now selling the site. TorSearch is
crawling with Apache Nutch using HBase and saves the pages to Solr.
Crawling takes weeks with powerful server. The future of TorSearch is
* I have been developing a better search interface that uses many
search properties provided by YaCy, check out the prototype
* Submited Google student midterm evaluation. Started using atom
editor, it is good btw. Prepared myself to the tor-dev meeting in Paris :)
Have a nice weekend everyone!
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/
-----END PGP SIGNATURE-----
More information about the tor-reports