ML Workshop at Codiax: BERT, Gradient Boosting and Journalism

Held a workshop today at Codiax 2011 with the title: Matching Journalists with Domain Experts: Text Classification with BERT and Gradient Boosting Trees from Idea to Production.

Abstract: We’ll start with a real world challenge: matching journalists interested in writing a story with relevant domain experts. We’ll examine the business use case, convert it into a technical one and discuss a baseline approach. We’ll then work on implementing a better and smarter solution, using a combination of Machine Learning algorithms. We’ll discuss performance, pros and cons, as well as future steps.

I’ve added the slides (written in jupyter notebook using the RISE extension), along with the support files, on Github: https://github.com/andreiolariu/codiax_workshop