Viacheslav Zhenylenko,美国加州旧金山开发人员
Viacheslav is available for hire
Hire Viacheslav

Viacheslav Zhenylenko

Verified Expert  in Engineering

机器学习开发人员

Location
旧金山,加州,美国
Toptal Member Since
June 13, 2019

Viacheslav在数据科学和软件工程方面有7年的经验. 他对从原始数据中获得的见解充满热情,并喜欢将它们转化为创造卓越的商业价值. Viacheslav的主要专长是Python,并具有Java和c++的生产经验. 解决数据量大的项目, 他应用了先进的机器学习技术, 比如计算机视觉, NLP, 产品推荐系统, networking data, 经典数据科学.

Portfolio

Briefly
Django, Web Frameworks, PostgreSQL, SQL...
Grata Inc.
Python, Docker, Elasticsearch, Celery, Kubernetes, Jenkins, PostgreSQL, REST...
Botprise, Inc.
亚马逊网络服务(AWS), Python, Flask, MongoDB, Apache Kafka, React Redux...

Experience

Availability

Part-time

首选的环境

Eclipse, Visual Studio Code (VS Code), PyCharm, Jupyter, MacOS, Linux, Vim Text Editor, Sublime Text, Bash

The most amazing...

...我开发的项目是业界首创的, 用于RAN网络中拥塞检测的自重构AutoML系统.

Work Experience

AI Engineer

2023 - 2023
Briefly
  • 多任务采用OpenAI模型, 包括总结, classification, recommendations, embeddings, agents, and function calling.
  • 使用Terraform开发分布式云基础设施, Amazon弹性容器服务(Amazon ECS), Amazon DynamoDB, Amazon EventBridge, Amazon Simple Queue Service (SQS), AWS Lambda, 和亚马逊简易电子邮件服务(SES).
  • 执行与外部服务和工具的API集成, including OpenAI, Google services, Slack, Notion, and Hubspot.
  • 用业务逻辑设计和构建多个Django api.
Technologies: Django, Web Frameworks, PostgreSQL, SQL, 生成预训练变压器(GPT), GPT, 自然语言处理(NLP), OpenAI Gym, React, Jupyter Notebook, ChatGPT, Automation, OpenAI GPT-3 API, OpenAI GPT-4 API, APIs, HubSpot, Notion, HubSpot CRM, OpenAI, 大型语言模型(llm)

Senior Data Engineer

2022 - 2022
Grata Inc.
  • 在Kubernetes上使用芹菜开发端到端分布式nlp地理编码管道.
  • 从公司网站和聚合器中实现抓取.
  • 在AWS上开发并部署了SageMaker网页分类模型.
  • 实现了一个带有查询查找的混合地理编码模型, query relaxation, result validation, prioritization, 还有后备机制.
  • 使用可用地理数据库和离线实体提取器(如libpostal和第三方地理编码服务)的组合,将其组合在一个视图中.
  • 增加了部分解析地址, 减少了95%的错误地址, 并将整体数据质量得分提高了20%.
Technologies: Python, Docker, Elasticsearch, Celery, Kubernetes, Jenkins, PostgreSQL, REST, Geocoding, Grafana, DataOps, Datadog, Pandas, NumPy, 自然语言工具包(NLTK), GIS, React, Cloud Infrastructure, SQL, Deep Learning, 亚马逊网络服务(AWS), Git, 敏捷软件开发, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, Django, Jupyter, 自然语言处理(NLP), 生成预训练变压器(GPT), GPT, Parallel Programming, PyCharm, Data Science, Flask, React Redux, Algorithms, Data Analysis, Python 3, SciPy, DevOps, Amazon SageMaker, Data Scraping, Software Development, Programming, Transformers, 并发编程, Data Engineering, Analytics, API Integration, Data Validation, Data Visualization, Dashboards, Databases, Software Engineering, Scikit-learn, Matplotlib, Unit Testing, ETL, GeoPandas, 质量保证(QA), APIs, Jupyter Notebook, Web Frameworks

Lead Data Scientist

2020 - 2022
Botprise, Inc.
  • 在平台上为完整的ML周期、ModelOps和MLOps开发后端. 在AWS SageMaker之上添加了一个包装器.
  • 参与了几十个自动化工作流(用例), including MLOps, analytics, DataOps, networks, ITOps, etc.
  • 使用React创建后端和前端元素进行拖放, chatbot-building应用程序.
  • 实现和部署了数十种算法(分类), clustering, time series, NLP, 和计算机视觉)用于不同的用例.
  • 领导一个小型机器学习团队,包括计划、管理、监控和领导.
技术:亚马逊网络服务(AWS), Python, Flask, MongoDB, Apache Kafka, React Redux, REST, Docker, Kubernetes, Amazon SageMaker, SciPy, NumPy, Pandas, PyTorch, TensorFlow, Transformers, Management, PostgreSQL, 机器学习操作(MLOps), React, Cloud Infrastructure, SQL, Computer Vision, Deep Learning, Keras, Git, 敏捷软件开发, Linux, Object Detection, 卷积神经网络, Bash, Sublime Text, Vim Text Editor, Anomaly Detection, Classification, Data Pipelines, AutoML, TCP/IP, Networks, Time Series Analysis, 并发编程, Predictive Analytics, Jupyter, OpenCV, Predictive Modeling, 机器学习自动化, GPT, 自然语言处理(NLP), 生成预训练变压器(GPT), PyCharm, Data Science, Algorithms, Python 3, AWS CloudFormation, DataOps, Software Development, Networking, AWS Lambda, 人工智能(AI), Programming, Datadog, 自然语言工具包(NLTK), Data Engineering, Analytics, API Integration, Data Validation, Data Visualization, Neural Networks, Dashboards, Databases, Software Engineering, Linear Regression, Microsoft Excel, Spreadsheets, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Plotly, APIs, Jupyter Notebook, Automation, Web Frameworks

Data Analyst

2020 - 2020
Spin (Tier Mobility)
  • 使用汽车选择和汽车再培训模型,对一家全球电动滑板车租赁公司(数百个城市)的电动滑板车需求进行时间序列预测.
  • 执行临时数据分析并构建Looker仪表板.
  • 进行干预效果分析(促销及其他活动).
技术:数据验证, Data Analysis, Data Analytics, Python, SQL, R, Data Science, Data Visualization, Google Cloud, 谷歌云平台, Looker, BigQuery, Google BigQuery, Pandas, NumPy, Git, 敏捷软件开发, Linux, Cloud Infrastructure, Bash, Sublime Text, Vim Text Editor, Data Pipelines, Time Series Analysis, Predictive Analytics, Jupyter, Predictive Modeling, PyCharm, Apache Airflow, Algorithms, Docker, Python 3, SciPy, Software Development, Programming, Analytics, 商业智能(BI), Dashboards, Databases, Software Engineering, Linear Regression, Spreadsheets, Scikit-learn, Matplotlib, Statistical Modeling, ETL, Plotly, Jupyter Notebook, 物联网(IoT)

高级MLOps工程师

2020 - 2020
职业足球焦点有限责任公司
  • 介绍并实现了MLOps技术、工具和方法.
  • 将一打单片R管道重建为分布式, modular, 以及函数式Python管道.
  • 在Dagster, Seldon, Feast等工具之上开发MLOps层.
  • 为速度和性能对现有模型超参数进行微调.
技术:数据科学, Python, R, Machine Learning, React, Python 3, Seldon, Dagster, RabbitMQ, PostgreSQL, Feast, 机器学习操作(MLOps), Cloud Infrastructure, SQL, Deep Learning, Pandas, NumPy, 亚马逊网络服务(AWS), Git, 敏捷软件开发, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, 可解释人工智能(XAI), Predictive Analytics, Jupyter, Predictive Modeling, Parallel Programming, PyCharm, Visual Studio Code (VS Code), Data Analytics, Apache Airflow, REST, Algorithms, Kubernetes, Docker, SciPy, Prefect, AWS CloudFormation, DevOps, Software Development, AWS Lambda, Programming, Dask, Data Engineering, Databases, Software Engineering, Linear Regression, Sports, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Plotly, 质量保证(QA), Jupyter Notebook, Web Frameworks, APIs

机器学习工程师

2020 - 2020
Plutoshift, Inc.
  • 向现有基础设施引入MLOps工具(Seldon、Feast和Great Expectations).
  • 迁移现有的硬编码模型以引入MLOps基础结构.
  • 使用Django为ml相关服务开发后端api.
  • 实现了制造传感器的分类和时间序列预测模型.
技术:机器学习, Python, Django, 卷积神经网络, Object Detection, TensorFlow, PyTorch, Keras, Apache Airflow, Cloud Infrastructure, 谷歌云平台, Azure, Cassandra, Apache Cassandra, Seldon, Feast, SQL, Deep Learning, Pandas, NumPy, Git, 敏捷软件开发, Linux, Google Cloud, Bash, Sublime Text, Vim Text Editor, Anomaly Detection, Classification, Data Pipelines, Time Series Analysis, Predictive Analytics, Jupyter, Predictive Modeling, PyCharm, Data Science, Flask, REST, Algorithms, Kubernetes, Docker, Python 3, SciPy, Software Development, AWS Lambda, Programming, Datadog, Data Engineering, Data Visualization, Databases, Software Engineering, Linear Regression, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, 质量保证(QA), Jupyter Notebook, 物联网(IoT), APIs

Senior AI Developer

2019 - 2020
Akcelita
  • 在AWS上为来自监控摄像头的照片开发了一个摄取和处理管道.
  • 实验了各种非深度学习和深度学习方法并进行了测试. 我还使用并训练了带有注意力机制的暹罗神经网络,达到了95%以上的准确率.
  • 创建并与分析人员共享演示文稿,并为项目开发和维护Wiki.
技术:亚马逊网络服务(AWS), TensorFlow, Python, Docker, PyTorch, AWS Lambda, Computer Vision, Deep Learning, Machine Learning, Classification, 人工智能(AI), Cloud Infrastructure, SQL, Pandas, NumPy, Keras, Git, 敏捷软件开发, Linux, 卷积神经网络, Bash, Sublime Text, Vim Text Editor, Data Pipelines, Predictive Analytics, Jupyter, OpenCV, Predictive Modeling, PyCharm, Data Science, Algorithms, Python 3, SciPy, Software Development, Programming, Data Visualization, Neural Networks, Software Engineering, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Jupyter Notebook

Team Lead

2018 - 2019
乌克兰国家科学院
  • 领导和指导一个学生团队. 我用敏捷方法定义了目标并控制了过程.
  • 创建了农作物分类和地图创建工具.
  • 手动收集数据,并通过网络抓取使用Mapillary和监督数据标签.
  • 实现并测试了DeblurGAN和其他几个经典的去模糊方法.
  • 通过微调ResNet模型,监督田间定位(YOLO)和作物分类.
Technologies: OpenCV, PyTorch, TensorFlow, Keras, Python, Python 3, Computer Vision, 生成对抗网络(GANs), Cloud Infrastructure, Deep Learning, Pandas, NumPy, 亚马逊网络服务(AWS), Git, 敏捷软件开发, Linux, Object Detection, 卷积神经网络, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, Predictive Analytics, Jupyter, Predictive Modeling, PyCharm, Visual Studio Code (VS Code), Data Science, Algorithms, Docker, SciPy, Beautiful Soup, Web Scraping, Data Scraping, Software Development, AWS Lambda, Programming, GIS, API Integration, Neural Networks, Software Engineering, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, GeoPandas, Jupyter Notebook

Senior Data Scientist

2017 - 2019
Openwave Mobility
  • 创建了一个多阶段的数据管道,从原始数据包数据(TCP/IP层)到机器学习模型的可消费输入,并在Python中实现多处理(CPython).
  • Trained, tuned, evaluated, 并比较了Python (scikit-learn)中的多个机器学习模型, Keras, XGBoost, CatBoost)和c++ (mlpack).
  • 监督数据分析和与利益相关者的沟通. 创建了一个可重用的Python工具,用于生成快速和外部可配置的数据分析报告.
  • 实现了基于专家知识聚合的自定义特征生成算法, derivatives, TCP/IP会话延迟, products, and fractions.
  • 实现了基于模型的自定义多阶段特征选择算法.
  • 在网络中部署和监控生产中的项目. 如果工具检测到拥塞,则应用优化策略. 客户报告说,视频内容的传输质量提高了20%.
Technologies: C++, Python, Python 3, 并发编程, 敏捷软件开发, Machine Learning, Deep Learning, Time Series Analysis, Networks, Networking, TCP/IP, TensorFlow, SciPy, Scikit-learn, Pandas, NumPy, mlpack, AutoML, 可解释人工智能(XAI), Cloud Infrastructure, SQL, 亚马逊网络服务(AWS), Git, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, Predictive Analytics, Jupyter, Predictive Modeling, 机器学习自动化, Parallel Programming, PyCharm, Data Science, Data Analytics, Algorithms, Docker, Data Analysis, Software Development, AWS Lambda, 人工智能(AI), Programming, Data Engineering, Analytics, Data Validation, Data Visualization, Neural Networks, Databases, Software Engineering, Linear Regression, Matplotlib, Statistical Modeling, Unit Testing, ETL, Plotly, Jupyter Notebook, Automation

Data Scientist

2015 - 2017
Octetis
  • 为在线商店开发、部署并评估了一个Python混合推荐引擎.
  • 监督客户行为分析、可视化和利益相关者沟通.
  • 使用策略模式处理不同的用户粘性场景. 根据受欢迎程度(一般和基于类别)给出上下文建议。, item-to-item, and SVD. (Python, scikit-learn, SciPy).
  • 将推荐引擎集成到Django后端.
  • 随机抽样进行多次A/B测试,对系统进行评估. 与类别基线中最受欢迎的项目进行比较, 我们每回合的购买量提升了150%,收益也有所增加.
  • 用Keras为在线云站点构造器创建了一个图像超分辨率模块.
  • 利用中深度CNN,在多个模糊内核上进行训练,并通过REST将其作为服务部署.
  • 进行的调查显示,该平台用户的满意度提高了约5%.
技术:TensorFlow, Keras, Python, Pandas, NumPy, Python 3, PostgreSQL, 推荐系统, Django, Data Analysis, Data Analytics, Machine Learning, 人工智能(AI), Cloud Infrastructure, MySQL, SQL, Computer Vision, Deep Learning, 亚马逊网络服务(AWS), Git, 敏捷软件开发, Linux, Object Detection, 卷积神经网络, Bash, Sublime Text, Vim Text Editor, Anomaly Detection, Classification, Data Pipelines, Time Series Analysis, Predictive Analytics, Jupyter, OpenCV, Predictive Modeling, PyCharm, Data Science, REST, Algorithms, Docker, SciPy, Jenkins, Microsoft Power BI, Software Development, AWS Lambda, Programming, Data Engineering, Analytics, API Integration, Data Validation, Data Visualization, Neural Networks, 商业智能(BI), Dashboards, Databases, Software Engineering, Linear Regression, Microsoft Excel, Spreadsheets, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Tableau, Plotly, Jupyter Notebook

软件工程师实习生

2015 - 2015
Facebook
  • 使用FBLearner Flow对AdaBoost客户流失预测模型进行培训和评估.
  • 为优化执行超参数调优.
  • 使用Hive进行数据工程,使用Python处理数据.
技术:Apache Hive, Python, Python 3, Data Pipelines, FBLearner Flow, Machine Learning, Git, 敏捷软件开发, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Predictive Analytics, Predictive Modeling, Data Science, Data Analytics, Algorithms, Software Development, Programming, Data Engineering, Analytics, Databases, Software Engineering, ETL

Research Intern

2014 - 2015
Samsung
  • 开发了智能键盘功能的算法(单词预测和拼写纠正).
  • 开发了n-gram的朴素贝叶斯和k -近邻(KNN)的拼写更正.
  • 使用拉普拉斯平滑和自定义键盘距离为KNN创建了更好的算法性能微调.
  • 用c++开发算法. 将它们与Java和Android键盘集成,并发布到App Store.
Technologies: C++, Android, Java, Machine Learning, 人工智能(AI), 生成预训练变压器(GPT), GPT, 自然语言处理(NLP), Android NDK, Git, 敏捷软件开发, Linux, Bash, Sublime Text, Vim Text Editor, Eclipse, Data Science, Algorithms, Software Development, Programming, Software Engineering

软件工程师实习生

2013 - 2014
Engage Point
  • 在雅加达EE开发了一个内容管理互操作性系统. 我对应用程序使用了模型-视图-控制器框架.
  • 为应用程序的业务逻辑开发了Enterprise JavaBeans.
  • 为表示级别开发JavaServer页面.
技术:Java EE, Java, Linux, Git, Bash, Sublime Text, Vim Text Editor, Eclipse, Software Development, Programming, Software Engineering

网页抓取

这是一个使用Mapillary API在特定位置下载图像的代码示例. 它搜索地图上的特定位置, 确定汽车的角度, 刮擦的只是道路近90度角的侧视图照片.

Languages

Python, SQL, Python 3, Bash, R, Snowflake, Fortran, Java, c++

Frameworks

Django, Flask, Web Frameworks

Libraries/APIs

Scikit-learn, Keras, TensorFlow, PyTorch, Pandas, NumPy, SciPy, Beautiful Soup, Dask, 自然语言工具包(NLTK), Matplotlib, OpenCV, React Redux, PySpark, React

Tools

Jupyter, PyCharm, IPython Notebook, Amazon SageMaker, Geocoding, GIS, Vim Text Editor, Sublime Text, Plotly, Apache Airflow, Git, Celery, Jenkins, AWS CloudFormation, Grafana, Looker, Microsoft Power BI, AutoML, RabbitMQ, BigQuery, Microsoft Excel, Spreadsheets, Tableau, Notion, Android NDK, OpenAI Gym

Paradigms

Data Science, 敏捷软件开发, REST, Unit Testing, ETL, Parallel Programming, DevOps, 并发编程, Anomaly Detection, 商业智能(BI), Automation, Management

Platforms

Docker, AWS Lambda, Jupyter Notebook, MacOS, 亚马逊网络服务(AWS), Linux, Apache Kafka, Kubernetes, Visual Studio Code (VS Code), Eclipse, Android, Java EE, 谷歌云平台, Azure

Storage

MySQL, PostgreSQL, Data Pipelines, Data Validation, MongoDB, Elasticsearch, Datadog, Databases, Apache Hive, Cassandra, Google Cloud

Other

Predictive Analytics, Predictive Modeling, 机器学习自动化, Computer Vision, Data Analytics, Deep Learning, Machine Learning, Mathematics, Applied Mathematics, Statistics, Algorithms, Data Analysis, Dagster, 机器学习操作(MLOps), Prefect, Computer Science, Web Scraping, Data Scraping, Software Development, 人工智能(AI), Programming, 计算科学, Time Series Analysis, 卷积神经网络, Object Detection, Seldon, Feast, Data Visualization, Data Engineering, Analytics, Neural Networks, Software Engineering, Linear Regression, Statistical Modeling, GeoPandas, APIs, 生成预训练变压器(GPT), ChatGPT, OpenAI GPT-3 API, OpenAI GPT-4 API, OpenAI, 推荐系统, 自然语言处理(NLP), Science, Scientific Computing, DataOps, Physical Science, Networking, Physics, Applied Physics, Transformers, TCP/IP, mlpack, 可解释人工智能(XAI), Cloud Infrastructure, Google BigQuery, API Integration, Dashboards, 质量保证(QA), GPT, 物联网(IoT), HubSpot, HubSpot CRM, 生成对抗网络(GANs), Networks, FBLearner Flow, Classification, Apache Cassandra, Sports, 软件架构, 大型语言模型(llm)

2019 - 2021

理论物理硕士(量子场论)

基辅国立大学-基辅,乌克兰

2018 - 2020

计算机数学与代数硕士学位

基辅国立大学-基辅,乌克兰

2010 - 2014

计算机科学与应用统计学学士学位

基辅国立大学-基辅,乌克兰

JULY 2023 - JULY 2026

AWS机器学习专业

Amazon Web Services

2023年1月- 2026年1月

AWS解决方案架构师助理

Amazon Web Services

2022年10月- 2025年10月

AWS认证开发者助理

Amazon Web Services

MAY 2017 - PRESENT

数据科学:从数据到洞察

MITProfessionalX DSx | edX

2017年1月至今

人工智能(AI)

ColumbiaX CSMM.101x | edX

MAY 2011 - PRESENT

Second Place

国际反腐败委员会,国家一级

JULY 2010 - PRESENT

Bronze Medal

国际数学奥林匹克(IMO)

MAY 2009 - PRESENT

Second Place

基辅国际物理节