杰雷米查莱特,法国Caluire-et-Cuire的开发者
Jeremie is available for hire
Hire Jeremie

杰雷米查莱特

验证专家  in Engineering

数据科学家, 机器学习 Engineer, and Software Developer

Location
Caluire-et-Cuire、法国
至今成员总数
2021年10月25日

Jeremie is a full-stack 数据科学家 focused on natural language processing and tabular data. 他在IT行业有十年的经验, 为英国政府工作, 大型企业, and startups while taking multiple hats as a back-end developer, DevOps, 数据科学家, 创业公司的技术联合创始人. 他领导了两个ML和心理健康社区, 最后但并非最不重要的是, he is an avid reader and lifelong learner who did many retreats.

Portfolio

Sprout.ai
Jupyter笔记本, Python, Docker, PyTorch, 拥抱的脸, Scikit-learn...
英国创业
Python, PyCharm, Scikit-learn, 亚马逊SageMaker...
数字、文化、媒体和体育部(DCMS)
Python, Pandas, NumPy, Matplotlib, SpaCy, 谷歌BigQuery

Experience

Availability

Part-time

首选的环境

PyCharm, 亚马逊网络服务(AWS), Jupyter笔记本, Slack, Git, Trello, Python, Notion, Linux

最神奇的...

...project I've done is just my last; I built a NLP framework on few-shot learning, 建立主动学习, 并提供了进行偏见分析的工具.

工作经验

自由职业全栈数据科学家

2020年至今
Sprout.ai
  • Ran experiments and built a framework to classify text with tiny labeled datasets; few-shot learning using natural language inference.
  • Provided a series of tools to better understand the company data and models, 比如主题建模, 模型explainability, 偏差分析, 异常值检测.
  • Set up a complete active learning workflow using Label Studio and dedicated back ends.
  • Packaged products such as NLI framework and active learning workflow using Docker containers.
  • Audited the company's ML platform on MLOps and identified next improvements to focus on.
  • Set up a development environment for Kubeflow with Kubernetes and migrated the NLI framework to the Kubeflow pipeline.
  • Conducted research on federated learning and suggested a series of bespoke and open-source solutions.
技术:Jupyter笔记本, Python, Docker, PyTorch, 拥抱的脸, Scikit-learn, 亚马逊网络服务(AWS), Kubeflow, Seaborn, Matplotlib, Plotly, 主题建模, 机器学习, 深度学习, 主动学习, GPT, 生成预训练变压器(GPT), 自然语言处理(NLP), 机器学习操作(MLOps)

全栈数据科学家

2020 - 2021
英国创业
  • Evaluated and compared a series of MLOps platform solutions like AWS SageMaker, Databricks, Kubeflow, and Cnvrg.
  • Designed and proposed multiple service architectures to implement MLOps.
  • 使用DVC设置mlop, MLFlow, 和SageMaker来跟踪实验, 火车模型, save, 然后部署它们.
  • Performed topic modeling and built a series of ensembles on text classification to identify stress and stressors in social media: a multi-modal deep learning classifier combining text and metadata. 这是数据科学训练营的一部分.
  • 生成综合数据来训练和评估模型.
  • Ran profiling to provide 8x improvement in inference time on a classifier.
  • 为Updraft和一家秘密创业公司做自由撰稿人.
技术:Python, PyCharm, Scikit-learn, 亚马逊SageMaker, 机器学习操作(MLOps), 自然语言处理(NLP), GPT, 生成预训练变压器(GPT), Profiling

数据科学家

2020 - 2020
数字、文化、媒体和体育部(DCMS)
  • Performed a literature review of state of the art in job offers classification.
  • 使用spaCy相似性API构建模型, comparing job offer descriptions and titles to UK Standard Industrial Classification (UK SIC) descriptions.
  • Built scripts to run the model on the whole collection (more than one million job offers) and run daily on new job offers.
技术:Python, Pandas, NumPy, Matplotlib, SpaCy, 谷歌BigQuery

AI研究员和高级开发人员

2019 - 2020
国家档案馆
  • Interviewed and assessed five suppliers on their solutions and reports from off-the-shelf record management products to bespoke solutions using fully customized models or AI APIs. 参与第三方技术评估项目.
  • Wrote a 50+ page report on NLP techniques and tools to select for permanent preservation records held by government departments to be shared with multiple audiences, 包括政府决策者, archivists, 数据科学家.
  • Delivered a new release on DROID, an open-source project. 执行或审核60多个拉取请求.
  • 在DROID上管理GitHub社区, 响应用户查询, 审查和合并拉取请求.
  • Increased project transparency and improved project prioritizing.
  • Advocated for the improvement of remote work in my department and offered guidance and support during the COVID-19 lockdown.
技术:Java, Python, 亚马逊SageMaker

首席技术官兼联合创始人

2015 - 2020
Trackener
  • Delivered an IoT product to the market to allow horse owners to look after their horses when they have chronic or acute health issues, 让他们的马保持健康和快乐, 找到内心的平静.
  • 领导我们产品的软件部分, 包括移动, web app, 以及后端开发, 服务器管理, 数据科学.
  • 负责技术团队的项目管理, including two hardware and software engineers and part-time contractors.
技术:PHP, JavaScript, 反应本地, 亚马逊网络服务(AWS), Docker, Java, MongoDB, Azure, MapReduce

高级Java开发人员

2015 - 2016
国家档案馆
  • 与一名研究员和一名数据专家密切合作, 接管了一个连接文件的原型, and designed and 实现 a set of applications to link collections, 评估它们并发布它们.
  • Designed, 实现, and deployed to live and maintained a set of back-end applications dedicated to the categorization of 20+ million records of the national archives for their end-user website Discovery, 使用Lucene, then Solr.
  • 管理运行我的应用程序的服务器.
  • Installed continuous integration platforms such as Jenkins, Nexus, and SonarQube.
  • Created an ML prototype to classify documents based on similarity with Lucene.
  • Ran a series of technical presentations to my department.
Java技术:, Spring Boot, Groovy, MongoDB, Neo4j, JProfiler

软件工程师

2011 - 2014
Atos的《欧博体育app下载》
  • 参与开发, 项目管理, 生产支持 of a mediation platform for a high-visibility project for Orange.
  • Contributed to multiple projects of varying size on the back ends of the leading French telecom company Orange, 包括项目管理和客户关系, 功能和技术设计, 实现, 生产支持.
  • Helped develop a banking application dedicated to mandating management in the SEPA norm for BPCE.
  • 监督一个位于印度的离岸开发团队, 包括支持, validation, 和监控.
Java技术:, Apache Maven, Spring, NGINX, Linux, SQL, MySQL, 面向对象设计(OOD), 软件架构

马匹护理的物联网产品

Trackener is a simple, easy-to-use system that enables owners to remotely monitor their horses 24/7.

我负责我们产品的软件:手机, web app, 以及后端开发, 服务器管理, 数据科学.
2009 - 2011

计算机科学硕士(MSc)学位

斯塔福德郡大学-斯塔福德,英国

2005 - 2010

Master of Engineering (MEng) Degree in General Engineering

Ecole Catholique d'Arts et Métiers de Lyon - Lyon, France

2017年1月至今

Startupbootcamp物联网连接设备

Startupbootcamp

库/ api

Scikit-learn, Pandas, NumPy, Matplotlib, PyTorch, SpaCy, Natural Language Toolkit (NLTK)

Tools

Slack, Git, PyCharm, Trello, YouTrack, G Suite, Amazon弹性容器服务(Amazon ECS), Seaborn, Notion, Gensim, 亚马逊SageMaker, JProfiler, Apache Maven, NGINX, CircleCI, MQTT, Plotly

Languages

Python, Java, HTML, JavaScript, SQL, CSS, PHP, Groovy

Platforms

Jupyter笔记本, Linux, Ubuntu, 亚马逊网络服务(AWS), Docker, Dataiku, Amazon EC2, Kubeflow, Azure

Paradigms

面向服务的架构(SOA), DevOps, 面向对象设计(OOD), MapReduce

Storage

MongoDB, Neo4j, Amazon S3 (AWS S3), MySQL

Frameworks

反应本地, Spring Boot, Spring, Flask

Other

计算机科学, 软件工程, 软件架构, 机器学习, 自然语言处理(NLP), 性能调优, GPT, 生成预训练变压器(GPT), 拥抱的脸, 深度学习, 工程数据, 交互设计(IxD), GloVe, MLflow, 数据版本控制, 谷歌BigQuery, 时间序列分析, 主题建模, 主动学习, 机器学习操作(MLOps), 创业, Profiling, 创业加速器

有效的合作

如何使用Toptal

Toptal matches you directly with global industry experts from our network in hours—not weeks or months.

1

分享你的需求

Discuss your requirements and refine your scope in a call with a Toptal domain expert.
2

选择你的才能

Get a short list of expertly matched talent within 24 hours to review, interview, and choose from.
3

开始你的无风险人才试验

Work with your chosen talent on a trial basis for up to two weeks. 只有当你决定雇佣他们时才付钱.

对顶尖人才的需求很大.

开始招聘