deepsense.io premiere: New Polish project to be presented to the world in June in the U.S.A.

deepsense.io premiere: New Polish project to be presented to the world in June in the U.S.A.

deepsense.io is a startup supported by CodiLime, a growing Polish software company. The main project being developed by the best programmers Poland has to offer is DS Studio, an innovative platform for complex Big Data analysis using machine learning solutions. This is precisely what will be presented at two large IT conferences taking place in June in California: Hadoop Summit and Spark Summit.

Is there a limit to athletes' abilities in athletic running events?

Is there a limit to athletes’ abilities in athletic running events?

I’m forming a group of students and grads interested in data analysis: The MI^2 (square) Group. The name comes from the fact that we have students from MIM University of Warsaw and MINI Warsaw Technical University. We are playing with different projects and ideas…

Mice, post hoc tests and diffograms

Mice, post hoc tests and diffograms

I’ve recently worked on an interesting problem. There are two types of mice. We select three animals of each type. We want to examine an effect of given treatment on nerve cells, more specific: on their dendritic spines (small protrusions located at neurons). From…

The marathon of teams’ data analysis - wrap-up

The marathon of teams’ data analysis – wrap-up

The first team’s data analysis marathon took place on last Saturday. Almost 60 participants turned up to take part in it (representing various levels of proficiency in the art of data analysis and different regions of Poland –most were from Warsaw but there were…

The marathon of teams’ data analysis

In just four days’ time we are going to start a marathon of teams’ data analysis. This time it’s a local Warsaw event, but next time? It’s up to us! Let us sum up what we know about that event.

Colors of cars

Colors of cars

Last week we tried to find out what is the color of the cars with the highest engine power. It turned out that black and black metallic are most popular colors of the fastest cars. Yet engine power is not all. We still may explore the relation between color and brand.

What color car is the fastest?

What color car is the fastest?

RECOMB 2015, a conference devoted to computational molecular biology (with emphasis on computational), came to an end yesterday. Many interesting papers were presented, yet this post was inspired by a conversation that I had the pleasure to have during dinner break…

IMDB + ggvis, a happy couple

IMDB + ggvis, a happy couple

Two weeks ago we showed how to scrap data from IMDB database with the use of rvest package. Last week we showed a shiny application, that compares ratings from two selected groups of users. Today we are going to finish the IMDB trilogy. This time I am going to show how to create an ggvis plot based on IMDB data.

You should not watch these movies with your wife / girl

You should not watch these movies with your wife / girl

Last week’s post showed how to download data on ratings of over 200 television series. The rating was broken down by gender and age of the user. The application presented below allows for selection of any two age/gender groups of users and comparison of their ratings…

R, rvest and web-harvesting

R, rvest and web-harvesting

Data harvested from the web pages is a source of interesting information. Pulling data used to require quite a lot of resilience and misshapen Perl scripts struggling with messy sources of web pages. Today’s web pages more and more frequently comply meet the standards. There are also more and more civilized tools for parsing websites.