Episode 31: Rethinking Data Science, Machine Learning, and AI
Description
Hugo speaks with Vincent Warmerdam, a senior data professional and machine learning engineer at :probabl, the exclusive brand operator of scikit-learn. Vincent is known for challenging common assumptions and exploring innovative approaches in data science and machine learning.
In this episode, they dive deep into rethinking established methods in data science, machine learning, and AI. We explore Vincent's principled approach to the field, including:
- The critical importance of exposing yourself to real-world problems before applying ML solutions
- Framing problems correctly and understanding the data generating process
- The power of visualization and human intuition in data analysis
- Questioning whether algorithms truly meet the actual problem at hand
- The value of simple, interpretable models and when to consider more complex approaches
- The importance of UI and user experience in data science tools
- Strategies for preventing algorithmic failures by rethinking evaluation metrics and data quality
- The potential and limitations of LLMs in the current data science landscape
- The benefits of open-source collaboration and knowledge sharing in the community
Throughout the conversation, Vincent illustrates these principles with vivid, real-world examples from his extensive experience in the field. They also discuss Vincent's thoughts on the future of data science and his call to action for more knowledge sharing in the community through blogging and open dialogue.
LINKS
- The livestream on YouTube
- Vincent's blog
- CalmCode
- scikit-lego
- Vincent's book Data Science Fiction (WIP)
- The Deon Checklist, an ethics checklist for data scientists
- Of oaths and checklists, by DJ Patil, Hilary Mason and Mike Loukides
- Vincent's Getting Started with NLP and spaCy Course course on Talk Python
- Vincent on twitter
- :probabl. on twitter
- Vincent's PyData Amsterdam Keynote "Natural Intelligence is All You Need [tm]"
- Vincent's PyData Amsterdam 2019 talk: The profession of solving (the wrong problem)
- Vanishing Gradients on Twitter
- Hugo on Twitter
Check out and subcribe to our lu.ma calendar for upcoming livestreams!