A library for summarizing data in streams for which it is infeasible to store all events
Apache Parquet Column (Incubating)

The DSI utilities are a mish mash of classes accumulated during the last ten years in projects developed at the DSI (Dipartimento di Scienze dell'Informazione, i.e., Information Sciences Department), now DI (Dipartimento di Informatica, i.e., Informatics Department), of the Universita` degli Studi di Milano.
Fast and robust NLP components implemented in Java.
High performance scientific and technical computing data structures and methods, mostly based on CERN's Colt Java API