💻 SELECT insight FROM chaos WHERE clarity = 'engineered'
作者
Experienced Big Data Engineer with 3+ years of expertise in designing and optimizing batch and real-time data pipelines, enhancing petabyte-scale data warehouse performance, and driving strategic insights for measurable commercial success. Proficient in big data frameworks including Spark, Flink, and Kafka; skilled with cloud platforms such as Alibaba Cloud, Google Cloud, and Amazon Web Services; and adept at dimensional modeling with a strong focus on analytics efficiency and governance.
工作经历
-
🏢 Tamira Tech,Singapore — Data Warehouse Engineer (Full-time)
02/2026 – Present
- Built real-time and offline data pipelines for image search models, improving recommendation satisfaction and boosting user NPS score (+3%).
- Unified commercial and core metrics pipelines, developed AB testing datasets, and improved analysis efficiency (+25%).
- Reconstructed algorithm data warehouse models, boosting BI efficiency (+30%) and reducing maintenance time (-10%).
- Developed Spark log analysis tools to optimize resource bottlenecks (+50% optimization efficiency).
-
🏢 Poizon (DeWU),Shanghai — Data Warehouse Engineer (Full-time)
07/2024 – 02/2026
- Built real-time and offline data pipelines for image search models, improving recommendation satisfaction and boosting user NPS score (+3%).
- Unified commercial and core metrics pipelines, developed AB testing datasets, and improved analysis efficiency (+25%).
- Reconstructed algorithm data warehouse models, boosting BI efficiency (+30%) and reducing maintenance time (-10%).
- Developed Spark log analysis tools to optimize resource bottlenecks (+50% optimization efficiency).
-
🏢 Poizon (DeWU),Shanghai — Big Data Developer (Intern)
05/2023 – 09/2023
- Migrated and optimized 500+ big data tasks on Galaxy platform, improving execution efficiency (+20%).
- Ensured post-migration data accuracy using SQL & Python; supported search algorithm warehouse model design.
-
🏢 NetEase Cloud Music — Big Data Developer (Intern)
09/2022 – 04/2023
- Designed core metrics for user and behavior analysis, contributing to strategy development.
- Optimized event tracking and introduced cold data archiving, reducing storage costs (-15%).
项目经历
-
EMR Spark Task Performance & Error Analysis Tool — Feature Development
02/2025 - 04/2025
- Developed automated log parsing tools for performance bottlenecks and error localization.
- Improved debugging and optimization speed significantly by creating analysis modules.
-
Image Search Evaluation & NPS Feedback Mechanism — Development
11/2024 - 12/2024
- Designed batch sampling platforms for algorithm evaluation, increasing monthly efficiency (+31 person-days).
- Built an NPS feedback mechanism pipeline to enhance user experience evaluation.
-
Poizon Push New Product Commercialization — Data Development
08/2024 – 10/2024
- Developed 15 ADS reports and established key commercial metrics (PVR, ASN), achieving SLA (+97%).
- Supported data pipeline enhancements, ensuring stable and scalable operations.
技能
- Data Warehousing & Modeling: Expert in dimensional modeling (star/snowflake), designing PB-scale data solutions.
- Big Data Tools: Proficient in Spark, Flink, Kafka, Hadoop, and distributed frameworks.
- Programming Languages: Java, Python, SQL for big data analysis; MySQL/PostgreSQL for high-performance queries.
- Data Governance: Skilled in pipeline orchestration, metadata management, and data monitoring.
- Soft Skills: Agile development, cross-team collaboration, and expert documentation.
隐私与评论
This website does not track visitor behavior nor require sensitive personal information (e.g., real names, phone numbers, etc.).