To achieve full understanding of the use and application of ML algorithms, our participants will work on a real-life industry project, translating theoretical knowledge to practical process and overcoming realistic challenges.
Scope:~400 work hours total
Data:Real data provided by company
Guidance:Experienced mentors provided by Y-DATA
Support:Weekly meetings with company data-owner
FundboxSegmentation of users based on clickstream events
Fundbox provides credit solutions for various types of users and industries. In order to do so, Fundbox needs to characterize customers using all available datThe aim of this project is to identify different types of users based on their behavior on Fundbox during the online registration flow, as described by a stream of their click events on the site, predict which users will complete it successfully and locate potential issues preventing successful registration.
Full project cycle
The process of working on the project follows popular industry standards and methodologies and incorporates a growing set of tools the students possess to methodically understand and solve a real-world problem. Our students have a full-cycle data science project in their portfolio upon graduation, covering all industry-standard stages: Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation.
Example ProjectAutomatic detection of low-value queries in technical Q&A forum
A customer operates a forum where programmers ask each other questions, provide answers and rate questions giving them \"ups\" and \"downs\". The forum has a core expert community that provides good answers and valuable insights. However, they often waste their time handling questions of little to no value: marking questions as duplicates and redirecting them, closing topics with incoherent or irrelevant questions etc. Because of this, the overall efficiency of the system suffers.