[tor-reports] GSoC: Weekly report for ahmia, week 26

Juha Nurmi juha.nurmi at ahmia.fi
Fri Jun 27 15:46:27 UTC 2014


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi all,

In this week:

Currently, ahmia.fi uses YaCy software that is P2P web crawler with
many features. There is one extremely good reason to use YaCy: it
could be scaled effectively if we would publish easy installation
script or ready-made VM machine or tutorial to join our voluntary
search network. After this we could crawl all the time and keep the
search updated.

* I am planning public open YaCy back-end for everyone:
https://github.com/juhanurmi/ahmia/issues/14 but this
http://forum.yacy-websuche.de/viewtopic.php?f=8&t=4845 'feature'
results that there has to be VPN-meshnet between YaCy nodes now :( My
good friend Mikko (kordex) has configured YaCy nodes and VPNs with
cfengine. He is the expert with YaCy and will look this when he has time.

* I Looked through some other free crawlers too. Heritrix web crawler
may be a good alternative to YaCy if one does not need P2P properties.
Still, I personally prefer building free open voluntary search network
with YaCy.

* I chatted with Chris MacNaughton who has built the
https://torsearch.es/ and is now selling the site. TorSearch is
crawling with Apache Nutch using HBase and saves the pages to Solr.
Crawling takes weeks with powerful server. The future of TorSearch is
unknown.

* I have been developing a better search interface that uses many
search properties provided by YaCy, check out the prototype
http://msydqstlz2kzerdg.onion/yacysearch.html

* Submited Google student midterm evaluation. Started using atom
editor, it is good btw. Prepared myself to the tor-dev meeting in Paris :)

Have a nice weekend everyone!

Greetings,
- -Juha
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1
Comment: Using GnuPG with Thunderbird - http://www.enigmail.net/

iQEcBAEBAgAGBQJTrZHIAAoJELGTs54GL8vA9KMH/R0vreeLogDm225yE5CwqpX/
56vhT1Jdc3VT5B0dWMmmOv2kg5Q3t5swkQWEne3hQwHZ6zOBHibwBn+Zvg3Mvx0l
AK1J2PEXx/StspYnVjNXNJec4naskoELcpkdSe2M6GZQWLmp9F+DwNn1c+BNS5UB
mjlUYQg1VbHQfEnYZ1KfS4gzSw6KIP1yYfwNGJkUC+esUPjXNECC7ZF4q5fVrYWp
dL4pMfbgqkBZKOYDwsffMqFc0BHcyHkLAGyHFRmv/FAP0fgzX+TCZSAd7vbE9p0W
6nJ+lxY51UiGZcLHUYgYqMUYc25rx99tEtFmoLwUTlp0N1H2neoAY5dVA5ePOo4=
=FIti
-----END PGP SIGNATURE-----


More information about the tor-reports mailing list