Больше информации по резюме будет доступно после регистрации

Зарегистрироваться
Был сегодня в 17:53

Мужчина, 29 лет, родился 14 мая 1996

Тбилиси, готов к переезду (Другие регионы, Россия), готов к командировкам

Senior Data Engineer (Eng)

Специализации:
  • Программист, разработчик

Тип занятости: полная занятость

Опыт работы 8 лет 4 месяца

Февраль 2022по настоящее время
4 года 3 месяца
Nitka Technologies
Senior Big Data Engineer
- Designed, developed, and optimized ETL pipelines to extract, transform, and load raw data from diverse sources into business-level aggregates, supporting a large-scale data ecosystem with 2,900 tables and 141 TB of data. (Python, PySpark, Azure Databricks, Kafka, AWS S3, Google BigQuery, MySQL, Snowflake, YAML) - Engineered PII data processing services compliant with CCPA, GDPR, and CPRA, ensuring data security and privacy for 1,000 – 100,000 daily requests. (Python, PySpark, Azure Databricks, MySQL, Airflow) - Developed custom Apache Airflow operators to automate the migration of ETL workflows from Rundeck to Airflow, significantly reducing manual efforts and accelerating data pipeline deployment. (Python, Airflow, YAML, CI/CD) - Created cost-optimization pipelines and dynamic dashboards to identify and monitor cost-saving opportunities, reducing platform operational expenses by 12%. (Python, PySpark, Azure Databricks, MySQL, Data Visualization) - Built a scalable data ingestion framework leveraging 3rd-party APIs, ensuring robust data availability and expanding platform capabilities for new business requirements. (Python, REST APIs, Airflow, Data Orchestration) - Implemented CI/CD pipelines using GitLab CI/CD for automated deployment, enhancing service stability and secure infrastructure management. (Python, GitLab CI/CD, DevOps)
Январь 2018Июль 2022
4 года 7 месяцев
Объединенный институт ядерных исследований

Дубна (Московская область)

Data Engineer
- Designed and implemented scalable streaming and batch ETL pipelines for data extraction from public sources, enabling daily ingestion of tens of gigabytes of raw data for analytical processing. (Python, PySpark, NoSQL) - Developed high-performance, parallel data parsers to bypass server restrictions, boosting data retrieval speed by 10x. (Python, Airflow, Web Scraping, REST API) - Applied machine learning algorithms and neural networks for NLP tasks, including text classification and clustering, improving data quality for advanced analytics. (Python, PySpark, Scikit-learn, Keras, NLP, Data Science) - Engineered back-end APIs and data services for a web platform to deliver business-ready data to stakeholders. (Python, Django, JavaScript, NoSQL, REST APIs)
Май 2020Май 2021
1 год 1 месяц
Тендерхелп

Дубна (Московская область)

Информационные технологии, системная интеграция, интернет... Показать еще

Web-разработчик
- Provided comprehensive support and actively participated in the development of the company's website for the interaction of clients and tender agents, which allowed hundreds of clients to comfortably interact with agents daily (Python, Django, ORM, DRF, VueJS, MySQL) - Developed components for extracting data on the availability of tenders from open sources and uploading it to CSV format, which reduced data processing time from 3 hours of manual work to 15 minutes of parser work. (Python, Django, VueJS)

Навыки

Уровни владения навыками
Python
PySpark
Azure Databricks
Apache Airflow
Apache Spark
ETL
AWS S3
MySQL
NoSQL
Git
REST API
CI/CD
Data engineering
Spark
AWS
Kafka

Обо мне

Senior Data Engineer with a proven track record in designing, developing, and optimizing scalable ETL pipelines (batch and streaming) and big data platforms for efficient data integration, transformation, and consolidation. Expertise in data architecture, distributed systems, and performance tuning. Skilled in mentoring and onboarding team members.

Высшее образование (Магистр)

2019
Высшее образование (Магистр)
Системного анализа и управления, Геоинформационные технологии
2017
Высшее образование (Магистр)
Распределенных информационно-вычислительных систем, Программная инженерия

Знание языков

Русский — Родной

Английский — B2 — Средне-продвинутый

Повышение квалификации, курсы

2019
CERN School of Computing
CERN, Physics Computing, Software Engineering, Data Technologies.
2018
Coursera
Coursera, Introduction to machine learning. / Specialization - Machine Learning and Data Analysis.

Гражданство, время в пути до работы

Гражданство: Россия

Желательное время в пути до работы: Не имеет значения