Data Engineer CV - Dmitry Alekseev

About Me

I am a passionate Data Engineer with experience in designing and implementing data pipelines, developing ETL processes, and creating data warehouse architectures. I have skills in working with modern technologies that ensure the reliability and efficiency of data processing.

I graduated with a Bachelor degree in Applied Mathematics and Informatics and currently work as a Data Analyst at Lime HD. In my current role, I go beyond the standard responsibilities of a data analyst and take on a wide range of tasks characteristic of a Data Engineer:

Developing ETL processes and integrating data from various sources (different APIs, databases, logs, web scraping, and parsing).
Building and maintaining infrastructure for data processing and implementing solutions for automated processing and optimization of workflows.
Designing and creating visually clear and user-friendly dashboards for tracking critical technical and business metrics.
Working with modern platforms and tools for data analysis, processing, and visualization (Grafana, ELK, Zabbix, Apache Airflow, Python, SQL, PromQL, ClickHouse, Prometheus).

My experience includes:

Conducting exploratory data analysis (EDA) to identify key patterns and trends.
Strong knowledge of the SQL dialect ClickHouse, including writing and optimizing queries for effective analysis and processing of large data volumes.
Setting up and maintaining monitoring systems like Prometheus and Grafana to control infrastructure health and prevent failures.
Developing and optimizing Dockerfiles for creating custom containers and using docker-compose to build multi-container applications.
Writing bash scripts to automate daily operations such as backups and database dumps.
Managing systemd services, including creating custom unit files.
Identifying and resolving issues with data quality and reliability. Using monitoring tools to troubleshoot and correct data errors.
Documenting and standardizing data processes. Cleaning and transforming data to eliminate gaps, errors, and anomalies.
Git: managing code versions, working with branches, and collaborating on development.

With over a year of experience working in an accredited IT company, I am constantly looking for ways to improve processes, automate workflows, and implement new technologies to enhance efficiency. Working with cross-functional teams, I provide analytical insights that help make data-driven decisions and contribute to the growth and success of the organization.

I strive to further develop my expertise in data analysis and engineering, explore new technologies and approaches, and maximize my contribution to creating reliable and scalable solutions. In my understanding, a modern Data Engineer is a technical specialist responsible for the data lifecycle: from collection and processing to providing analytical tools. This role requires knowledge of programming, big data, infrastructure management, and ensuring data quality. This is the direction I aim to grow in, focusing on building efficient, reliable, and scalable solutions that deliver real value to businesses.

Skills & Tools

Python

Data Cleaning & Processing, ETL, Automation, API Development & Integration (Building RESTful APIs using FastAPI and integrating third-party APIs with requests and aiohttp, Modeling (Predictive and descriptive models), Statistical Analysis, Writing efficient scripts for data manipulation and task automation

Data Engineering

Data Pipelines: Building and optimizing ETL/ELT pipelines
Data Warehousing: Designing scalable architectures for the preparation of reports and business analysis to support decision-making in the organization

Documentation

Creating detailed documentation for data solutions, standardizing workflows and ensuring maintainable processes Versioning and managing documentation using tools like Confluence, Notion, or Git

Data Visualization

Designing interactive and real-time dashboards using Grafana, Yandex DataLens, ReDash, etc.; Creating insightful and customizable visualizations (Matplotlib, Plotly, Seaborn, HighCharts JS); Reporting Tools (Generating automated and presentation-ready reports with Pandas Styler, openpyxl, ReportLab)

SQL & NoSQL

Relational & Analytics Databases: MySQL, Postgres (Query Optimization, ProxySQL, Backup & Restore) ClickHouse (Columnar Storage, High-Speed OLAP Queries, Distributed Clusters, Real-Time Analytics); NoSQL: Redis(In-Memory Key-Value)

Machine Learning

Supervised & Unsupervised Learning, Deep Learning (TensorFlow, Keras), Hyperparameter Tuning, Feature Engineering (Extracting and transforming features), Time-Series Analysis, Ensemble Methods (Combining models with stacking, bagging, and boosting techniques for robust predictions), Model Deployment

DevOps

Containerization & Orchestration (Docker, docker-compose for building, deploying, and managing containerized applications at scale), CI/CD Integration, Monitoring (Zabbix, Prometheus, Grafana), Version Control (gitlab)

Linux / Bash

Service Management (systemd), Logs & Diagnostics (journalctl), User Management, Permissions, Process Monitoring (top, htop, ps), Scheduling Tasks (cron, systemd timer), Networking (iptables, ip, netstat), File System Management

Apache Airflow

Workflow Scheduling & Orchestration (Designing, scheduling, and orchestrating complex data workflows), DAG Management (Building, maintaining, and optimizing Directed Acyclic Graphs (DAGs) for efficient workflow execution), Deployment and Configuring Airflow for distributed execution using Celery

Soft Skills & Collaboration

Effectively communicating with analysts to translate business requirements into technical solutions; Collaborating with database administrators to optimize database performance and ensure data integrity; Working with DevOps teams to maintain data infrastructure stability; Partnering with backend developers to design scalable solutions and integrate APIs for seamless data flow; Facilitating cross-team collaboration to align on goals, streamline workflows, and deliver good results

Observability Tools

ELK Stack: ElasticSearch (Full-Text Search), Logstash (data ingestion pipelines, parsing logs), Kibana (Data Visualization & Dashboards); Zabbix: Monitoring System Performance, Metrics Collection Prometheus: Collecting and querying metrics using PromQL for application and infrastructure performance monitoring; Grafana: Configuring alerts and thresholds for proactive monitoring

Professional Experience

Data Analyst / Data Engineer - Lime HD

2024 - Present

• Designed and implemented full-scale ETL/ELT pipelines for data integration and transformation from various sources, including APIs, databases, and web scraping.

• Built and maintained infrastructure for data processing and monitoring using tools such as Grafana, Prometheus, and ClickHouse.

• Automated data workflows with scripts (Python, Bash) and Docker containers, improving efficiency and reliability of daily operations.

• Developed dashboards to visualize critical technical and business metrics, enabling informed decision-making.

Data Analyst - Lime HD

2023 - 2024

• Conducted exploratory data analysis (EDA) to identify key trends and insights, supporting various business needs.

• Collaborated with cross-functional teams to integrate and standardize data from multiple sources for unified reporting.

• Provided actionable analytics to support decision-making and process optimization within the analytics department.

Аналитик данных / Инженер данных - Лайм Эйч Ди

2024 - настоящее время

• Разрабатывал и реализовывал масштабные ETL/ELT-конвейеры для интеграции и преобразования данных из различных источников, включая API, базы данных и веб-скрейпинг.

• Создавал и поддерживал инфраструктуру для обработки и мониторинга данных с использованием таких инструментов, как Grafana, Prometheus и ClickHouse.

• Автоматизировал рабочие процессы с помощью скриптов (Python, Bash) и контейнеров Docker, улучшая эффективность и надежность ежедневных операций.

• Разрабатывал дашборды для визуализации ключевых технических и бизнесовых метрик, обеспечивая принятие обоснованных решений.

Аналитик данных - Лайм Эйч Ди

2023 - 2024

• Проводил исследовательский анализ данных (EDA) для выявления ключевых закономерностей и тенденций, поддерживая различные бизнес-задачи.

• Сотрудничал с межфункциональными командами для интеграции и стандартизации данных из нескольких источников для унифицированной отчетности.

• Предоставлял аналитические данные для поддержки принятия решений и оптимизации процессов в отделе аналитики.

Education

Master Mathematical cybernetics 01.04.02

ChuvSU named after I. N. Ulyanov | 2023 - 2025

Bachelor Applied mathematics and computer science 01.03.02

ChuvSU named after I. N. Ulyanov | 2019 - 2023

Certificate of English Proficiency - B2 (Upper Intermediate)

View Certificate | Awarded on: 31 Aug 2022

Diploma I Degree

International Student Scientific Conference on Technical, Humanitarian, and Natural Sciences

Scientific work: "Modeling Short-Term Inflation Forecasts" in the section "Mathematical Models in Economics and Numerical Analysis". | 2023

Научная работа: "Моделирование краткосрочного прогноза инфляции" в секции "Математические модели в экономике и численный анализ". | 2023

Свидетельство о прохождении курса "Бизнес-Анализ"

Посмотреть свидетельство | Год прохождения: 2022

Dmitry Alekseev

About Me

Skills & Tools

Python

Data Engineering

Documentation

Data Visualization

SQL & NoSQL

Machine Learning

DevOps

Linux / Bash

Apache Airflow

Soft Skills & Collaboration

Observability Tools

Tools & Technologies Stack

Data Pipeline Flow

Professional Experience

Data Analyst / Data Engineer - Lime HD

2024 - Present

Data Analyst - Lime HD

2023 - 2024

Аналитик данных / Инженер данных - Лайм Эйч Ди

2024 - настоящее время

Аналитик данных - Лайм Эйч Ди

2023 - 2024

Education

Master Mathematical cybernetics 01.04.02

Bachelor Applied mathematics and computer science 01.03.02

Certificate of English Proficiency - B2 (Upper Intermediate)

Diploma I Degree

Certificate in Business Analysis

Магистр 01.04.02 Математическая кибернетика

Бакалавр 01.03.02 Прикладная математика и информатика

Сертификат уровня владения английским языком - B2 (Выше среднего)

Диплом I степени

Свидетельство о прохождении курса "Бизнес-Анализ"

Contact