[metrics-bugs] #25644 [Metrics/CollecTor]: Write white paper about CollecTor's data processing (Sponsor13, 1)

Tor Bug Tracker & Wiki blackhole at torproject.org
Thu Mar 29 20:49:54 UTC 2018


#25644: Write white paper about CollecTor's data processing (Sponsor13, 1)
-------------------------------+--------------------------
 Reporter:  iwakeh             |          Owner:  iwakeh
     Type:  task               |         Status:  accepted
 Priority:  Medium             |      Milestone:
Component:  Metrics/CollecTor  |        Version:
 Severity:  Normal             |     Resolution:
 Keywords:                     |  Actual Points:
Parent ID:                     |         Points:
 Reviewer:                     |        Sponsor:
-------------------------------+--------------------------

Comment (by iwakeh):

 In 2013 JSR 352 Batch Applications for the Java Platform was finalized.
 As the main implementations are Java EE 7 and Spring Batch these two
 should be covered by this activity.  Other suitable frameworks can be
 found in streaming and data processing fields.  These focus usually on
 real-time processing, which is not CollecTor's concern, but also provide
 solutions for the main batch processing tasks: retrieve from a source,
 process, and write the data.  Thus, we should also take a look at
 [https://flink.apache.org/ Apache's Flink] streaming framework that
 explicitly features its own Batch DataSet API.  Flink is also well
 integrated into Apache's Java tooling/framework environment.

 Thus, the list of batch frameworks we evaluate is Java EE and Spring (as
 JSR 352 implementations) and Flink.

--
Ticket URL: <https://trac.torproject.org/projects/tor/ticket/25644#comment:3>
Tor Bug Tracker & Wiki <https://trac.torproject.org/>
The Tor Project: anonymity online


More information about the metrics-bugs mailing list