
Daniil Terekhin
About
Data Scientist | Machine Learning Engineer | Computer Vision Specialist | MSc in Artificial Intelligence | 2+ Years of IT Consulting & Startup Experience
I have two years of hands-on experience as a Data Scientist in IT consulting and experience as a Project Manager and Machine Learning Engineer in startup ecosystems. My technical expertise includes working extensively with the PyTorch framework on computer vision projects—my thesis focused on "Automatic Segmentation of Asbestos Fibers in Low-Magnification Electron Microscopy Images," applying image pre-processing (OpenCV), augmentation, and cropping. Additionally, I have implemented for a Swiss startup an LLM chatbot with Retrieval-Augmented Generation (RAG) using vector database FAISS, enabling context-aware responses through vector-based information retrieval. I have also used C++ for HPC, and I have several NLP projects, where I have trained from scratch and used pretrained models for making chatbot game with voice interaction. In my previous Data Scientist role, I handled real-time big data analysis using PySpark, developed AIRFLOW scripts for automated data uploads, and created insightful monthly reports with PowerBI. I collaborated with both clients and internal teams, applying Machine Learning to optimize data-driven strategies across various projects. I implemented state-of-the-art machine learning approaches to solve business problems, including anomaly detection for a major bank, A/B testing for a recommendation system, residential real estate project.
Skills
Airflow
Computer Vision
Data Science
Data Visualization
Deep Learning
Hadoop
Machine Learning
MySQL
PostgreSQL
Python
Open for
fulltime
Work Experience
Norbit
2021-01 - 2022-05
Data Scientist
Moscow
fulltime
This company is a leading expert in the IT consulting market and the development and implementation of effective solutions for business automation. Big Data Analysis and Reporting: – Conducted real-time big data analysis using PySpark to process streaming data from client websites, focusing on user activity and purchase behavior. – Developed AIRFLOW scripts for nightly data uploads, ensuring consistent data availability. – Generated monthly trend reports and vital statistics for clients using PowerBI, highlighting key performance indicators and actionable insights. Client Communication and Data-Driven Strategy: – Collaborated closely with clients and team members to identify project objectives and essential predictive data requirements. – Regularly collected, analyzed, and presented data findings using PySpark and Pandas, maintaining transparent communication about progress and potential challenges. – Explored and tested various hypotheses based on the data using Machine Learning approaches, contributing to the development of informed, data-driven strategies
Black VR
2019-07 - 2020-05
Project Manager
Moscow
fulltime
BlackVR Is an IT company for the development of AR and VR technology, 360 video and websites production. My main tasks were - data analysis with excel and R, - leading projects and the development team, - helping to organize and conduct events involving with virtual reality helmets use and managing with full logistics for such events.
NavAI
2024-09 - 2024-12
Machine learning engineer intern
Lugano
internship
- Developed and Deployed an Intelligent Chatbot: Created a chatbot using Flask, later migrating to Socket.IO for real-time communication, enhancing user engagement and support. - Implemented Semantic Search with Vector Embeddings: Utilized vector databases (e.g., Chroma) to store embeddings, enabling efficient semantic retrieval of information and improving response relevance (RAG). - Advanced Prompt Engineering: Fine-tuned language models and crafted prompts to optimize the chatbot's conversational abilities and accuracy in understanding user queries. - Natural Language Processing Optimization: Developed NLP pipelines for text preprocessing, including tokenization and language detection, to handle multilingual inputs effectively. - Production Deployment and Scalability: Deployed the chatbot in a scalable environment, ensuring high performance and the ability to handle increased user traffic.
Academic Experience
National Research University Higher School of Economics -
2017.08 - 2021.07
Bachelor of Science, BSc in Business-Informatics
Università della Svizzera italiana -
2022.09 - 2024.09
Master of Science, MSc in Artificial Intelligence