This new meet-up is focused on Generative AI such as LLMs and diffusion models.
Expect to see product demos, papers discussed, emerging business models analysed, best practices, infrastructure, tooling etc. These will generally be discussed along technical, social, economical, philosophical themes. The format is simple: some talks, chats, pizza, refreshments.
How to train an LLM for free
Fionnán Alt — Founder, MainsailAI
Large Language Models (LLMs) present a step-change in technology capability as well as profound and wide-ranging societal, legal, ethical, philosophical and economic impacts. In addition, due to their size and complexity, building or training an LLM from scratch is often viewed as being prohibitively expensive outside of large tech companies or well-funded AI labs.While state-of-the-art models such as OpenAI's GPT-4 are estimated to cost $100m+ for a single training run, recent advances have reduced the overhead required for running and training LLMs.
In this talk, Fionnán demonstrates how to train a capable LLM from scratch using free resources kindly provided by the TPU Research Cloud. We will explore using modern ML frameworks and technologies such as JAX, Ray, and Tensor Processing Units (TPUs) to efficiently scale out neural net training tasks for models with billions of parameters.
Fionnán is a data science leader who has been helping high-performing AI teams scale companies for the past decade. He is also an advisor to a number of startups, focusing on AI strategy, and provides consulting in the LLM space with MainsailAI.
Uncovering granular insights about your LLMs
Morgan McGuire — Head of Growth ML, Weights & Biases.
Weights & Biases new LLM debugging tool, with Trace Timeline and Trace Table, is a natural extension of its scalable experiment tracking, designed to support ML practitioners working on prompt engineering for LLMs. It makes it easy review past results, identify and debug errors, gather insights about model behavior, and share learnings with colleagues.
In this talk, Morgan will demonstrate how to introduce a streamlined debugging workflow into your process, with some practical examples to get you started.
Morgan leads the Growth ML team at Weights & Biases. Previously he worked on data and ML at Meta and prior to that as an analyst in two electricity trading houses.
Name classification: How does ChatGPT compare to machine learning language models?
Pedro Tabacof — Staff ML Scientist, Intercom.
In this presentation, Pedro shows how to use machine learning models and ChatGPT to classify names into person or company (e.g. "Google" is a company while "Barack Obama" is a person). The task of name classification is just an illustrative example of text classification. The target audience includes data scientists interested in applied natural language processing (NLP) and/or how to use ChatGPT in practice.
First, we present the Kaggle datasets used and how we can start with a simple baseline using scikit-learn logistic regression. Then, we show how to use two more complex models for the same task: FastAI LSTM and Hugging Face DistilBERT pre-trained language models. Finally, we show how we can leverage ChatGPT for its predictive capabilities: This only requires calling its API, a fallback method when it refuses to classify a name, and some textual post-processing to extract the answers. As simple as it sounds, this achieves higher accuracy than the previous models without any training whatsoever. The whole presentation can be reproduced from a publicly available Jupyter notebook.
Pedro Tabacof a staff machine learning scientist at Intercom in Dublin. Previously, he worked as a data scientist at Wildlife Studios, Nubank, and iFood. Academically, he has a Master's degree in deep learning and 300+ citations.
Intercom 2nd Floor, Stephen Court, 18-21 St Stephen's Green, Dublin 2, D02 N960
- 18:00 - Arrival & check-in*
- 18:25 - Welcome & kick off
- 18:30 - First Talk, followed by Q&A
- 19:00 - Second Talk, followed by Q&A
- 19:30 - Break / Pizza / Refreshments
- 19:50 - Third Talk, followed by Q&A
- 20:20 - Wrap up & "thank yous"
* Please note: Intercom operate a check-in for all guests, we will provide additional details the day before this event takes place.
Eamon Leonard
A veteran of 20 years developing and building products, leading teams, and creating communities. Partner at Broadstone, a pre-seed fund. eamon@broadstone.vc
Fionnán Alt
A seasoned data science leader, and technical advisor to a number of startups focusing on AI strategy. Founder MainsailAI fionnan@mainsail.ai