Improve Apache Spark aggregate performance with batching

Improve Apache Spark aggregate performance with batching

Seahorse provides users with reports on their data at every step in the workflow. A user can view reports after each operation to review the intermediate results. In our reports we provide users with distributions for columns in the form of a histogram for continuous data, and a pie chart for categorical data.

AAIA'16 Data Mining Challenge Winning

AAIA’16 Data Mining Challenge Winning

deepsense.io tops global competition for predicting dangerous seismic events in active coal mines.