Tensor-based Approach to Big Data Processing and Machine Learning
- Prelegent(ci)
- Maciej Bartoszuk
- Afiliacja
- QED Software
- Termin
- 25 listopada 2022 16:15
- Informacje na temat wydarzenia
- 4060 i online meet.google.com/jbj-tdsr-aop
- Seminarium
- Seminarium badawcze „Systemy Inteligentne”
We present an approach to tensor compression, decomposition and processing algorithms on the top of them. Our implementation uses the popular scalable data processing framework Apache Parquet to effectively store data. This library does not directly store tensors as native data types, but we slightly changed the implementation for our purpose using its specific data storage format and extending it with additional compression. We summarize the performance of tensor storage as well as the effectiveness of multiple machine learning methods and their hyperparameter tuning.
---------------------------------------
Plan wystąpień w tej edycji jest dostępny tutaj
The schedule of presentations can be checked here