[tor-commits] [webwml/staging] Add Ahmia project idea

hiro at torproject.org hiro at torproject.org
Wed Feb 15 12:21:20 UTC 2017


commit 1f5eec9375320854904e1bf8360fac828823605c
Author: Damian Johnson <atagar at torproject.org>
Date:   Tue Jan 31 08:52:35 2017 -0800

    Add Ahmia project idea
    
    GSoC project idea from Juha Nurmi.
---
 getinvolved/en/volunteer.wml | 62 ++++++++++++++++++++++++++++++++++++++++++++
 1 file changed, 62 insertions(+)

diff --git a/getinvolved/en/volunteer.wml b/getinvolved/en/volunteer.wml
index 37f6879..64b30b1 100644
--- a/getinvolved/en/volunteer.wml
+++ b/getinvolved/en/volunteer.wml
@@ -1114,6 +1114,68 @@ ideas.
     </p>
     </li>
 
+    <a id="ahmiaSearch"></a>
+    <li>
+    <b>Ahmia - Hidden Service Search</b>
+    <br>
+    Language: <i>Python, Django</i>
+    <br>
+    Likely Mentors: <i>Juha Nurmi (numes), George (asn)</i>
+    <p>
+    Ahmia is open-source search engine software for Tor hidden service deep
+    dark web sites. You can test the running search engine at ahmia.fi. For
+    more information see our <a
+    href="https://blog.torproject.org/category/tags/ahmiafi">blog post about
+    Ahmia's GSoC2014 development</a>.
+    </p>
+
+    <p>
+    Ahmia is a working search engine that indexes, searches, and catalogs
+    content published on Tor Hidden Services. Furthermore, it is an environment
+    to share meaningful insights, statistics, insights, and news about the Tor
+    network itself. In this context, there is a lot of work to do.
+    </p>
+
+    <p>
+    The Ahmia web service is written using the Django web framework. As a
+    result, the server-side language is Python. On the client-side, most of the
+    pages are plain HTML. There are some pages that require JavaScript, but the
+    search itself works without client-side JavaScript.
+    </p>
+
+    <p>
+    There are several possible directions for this project, including...
+    </p>
+
+    <ol>
+      <li>Automate blacklisting (very important)<br />
+        <ul>
+          <li>Fetch a list of child abuse media sites</li>
+          <li>Remove these sites from the search results</li>
+        </ul>
+      </li>
+      <li>Add hidden services funtion (very important)<br />
+        <ul>
+          <li>You can add onions using HTML form</li>
+          <li>Call the crawler immidiately when a new site is added</li>
+        </ul>
+      </li>
+      <li>Elasticsearch<br />
+        <ul>
+          <li>Must be updated to 5.X.X sooner or later</li>
+          <li>Adjust the settings</li>
+          <li>Automatically remove data older than, for instance, 90 days</li>
+        </ul>
+      </li>
+      <li>Maintainance<br />
+        <ul>
+          <li>Update all software dependencies</li>
+          <li>Automate crash recovery for Tor, Elasticsearch and crawler</li>
+        </ul>
+      </li>
+    </ol>
+    </li>
+
 <!--
     <a id=""></a>
     <li>





More information about the tor-commits mailing list