Joinup

About

Data Scientist | Machine Learning Engineer | Computer Vision Specialist | MSc in Artificial Intelligence | 2+ Years of IT Consulting & Startup Experience

I have two years of hands-on experience as a Data Scientist in IT consulting and experience as a Project Manager and Machine Learning Engineer in startup ecosystems. My technical expertise includes working extensively with the PyTorch framework on computer vision projects—my thesis focused on "Automatic Segmentation of Asbestos Fibers in Low-Magnification Electron Microscopy Images," applying image pre-processing (OpenCV), augmentation, and cropping. 

Additionally, I have implemented for a Swiss startup an LLM chatbot with Retrieval-Augmented Generation (RAG) using vector database FAISS, enabling context-aware responses through vector-based information retrieval. I have also used C++ for HPC, and I have several NLP projects, where I have trained from scratch and used pretrained models for making chatbot game with voice interaction.

In my previous Data Scientist role, I handled real-time big data analysis using PySpark, developed AIRFLOW scripts for automated data uploads, and created insightful monthly reports with PowerBI. I collaborated with both clients and internal teams, applying Machine Learning to optimize data-driven strategies across various projects. I implemented state-of-the-art machine learning approaches to solve business problems, including anomaly detection for a major bank, A/B testing for a recommendation system, residential real estate project.

Skills

Airflow

Computer Vision

Data Science

Data Visualization

Deep Learning

Hadoop

Machine Learning

MySQL

PostgreSQL

Python

Open for

fulltime

Work Experience

Norbit

2021-01 - 2022-05

Workplace
Data Scientist
Location

Moscow

Employement type

fulltime

This company is a leading expert in the IT consulting market and the development and implementation of effective solutions for business automation. Big Data Analysis and Reporting: – Conducted real-time big data analysis using PySpark to process streaming data from client websites, focusing on user activity and purchase behavior. – Developed AIRFLOW scripts for nightly data uploads, ensuring consistent data availability. – Generated monthly trend reports and vital statistics for clients using PowerBI, highlighting key performance indicators and actionable insights. Client Communication and Data-Driven Strategy: – Collaborated closely with clients and team members to identify project objectives and essential predictive data requirements. – Regularly collected, analyzed, and presented data findings using PySpark and Pandas, maintaining transparent communication about progress and potential challenges. – Explored and tested various hypotheses based on the data using Machine Learning approaches, contributing to the development of informed, data-driven strategies

Black VR

2019-07 - 2020-05

Workplace
Project Manager
Location

Moscow

Employement type

fulltime

BlackVR Is an IT company for the development of AR and VR technology, 360 video and websites production. My main tasks were - data analysis with excel and R, - leading projects and the development team, - helping to organize and conduct events involving with virtual reality helmets use and managing with full logistics for such events.

NavAI

2024-09 - 2024-12

Workplace
Machine learning engineer intern
Location

Lugano

Employement type

internship

- Developed and Deployed an Intelligent Chatbot: Created a chatbot using Flask, later migrating to Socket.IO for real-time communication, enhancing user engagement and support. - Implemented Semantic Search with Vector Embeddings: Utilized vector databases (e.g., Chroma) to store embeddings, enabling efficient semantic retrieval of information and improving response relevance (RAG). - Advanced Prompt Engineering: Fine-tuned language models and crafted prompts to optimize the chatbot's conversational abilities and accuracy in understanding user queries. - Natural Language Processing Optimization: Developed NLP pipelines for text preprocessing, including tokenization and language detection, to handle multilingual inputs effectively. - Production Deployment and Scalability: Deployed the chatbot in a scalable environment, ensuring high performance and the ability to handle increased user traffic.

Academic Experience

National Research University Higher School of Economics -

 

2017.08 - 2021.07

Bachelor of Science, BSc in Business-Informatics

Università della Svizzera italiana -

 

2022.09 - 2024.09

Master of Science, MSc in Artificial Intelligence