About this role
<div> <div> <p><span>Job Title:</span><span> Principal Data Platform Engineer</span><span> </span></p> </div> <div> <p> </p> <p><span>Location: </span><span>Malaysia</span><span> </span></p> </div> <div> <p> </p> <p><span>Job Description</span><span> </span></p> </div> <div> <p> </p> <p><span>Role Mission: </span><span>To lead and scale the </span><span>Data Platform Engineering and Operations</span><span> function within StarHub’s Digital Experience Platform (DXP) Data organization. This role ensures the continuous </span><span>reliability, scalability, and security</span><span> of StarHub’s C360 cloud data platform built on AWS, Snowflake, SageMaker, and Datapipe. The incumbent drives </span><span>operational excellence, automation, and engineering maturity</span><span> across the platform, while prototyping and rolling out </span><span>new platform capabilities</span><span> that enable agility, innovation, and performance for data and AI workloads across the enterprise.</span><span> </span></p> </div> <div> <p> </p> <p><span>Accountabilities: </span></p> </div> <div> <ol style="list-style-type:decimal" start="1"> <li> <p><span>Own end-to-end </span><span>infrastructure and platform operations</span><span> of the DXP Data Platform across AWS, Snowflake, and SageMaker environments (DEV, SIT, PROD).</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="2"> <li> <p><span>Lead the </span><span>design, build, and automation of data platform engineering and DevOps practices</span><span>, ensuring continuous improvement and zero-downtime operations.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="3"> <li> <p><span>Lead the prototyping, implementation, and rollout of </span><span>new platform capabilities and services</span><span> across AWS, Snowflake, and SageMaker.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="4"> <li> <p><span>Implement governance, </span><span>security, and compliance standards & improvements</span><span> for cloud infrastructure, data access, and network controls.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="5"> <li> <p><span>Drive operational excellence through </span><span>monitoring, alerting, cost optimization</span><span>, and performance tuning.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="6"> <li> <p><span>Manage a hybrid team</span><span> of internal platform engineers and vendor-augmented resources supporting Day 2 operations and enhancements.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="7"> <li> <p><span>Partner</span><span> with Data Engineering, Architecture, Security, Infrastructure & Tooling teams to ensure aligned technical roadmaps, compliance readiness, and audit traceability.</span><span> </span></p> </li> </ol> </div> <div> <p> </p> <p><span>Responsibilities:</span></p> </div> <div> <ol style="list-style-type:decimal" start="1"> <li> <p><span>Platform Engineering & Operations:</span><span> </span></p> </li> </ol> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Own AWS infrastructure for DXP Data Platform (multi-AZ, multi-VPC setup).</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Administer Snowflake environments, including user roles, RBAC, performance optimization, warehouse lifecycle, and cost controls.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Manage SageMaker environments (Studio, Canvas, Notebooks) for enabling multi-domain ML use cases.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Operate DataPipes (EKS + Airflow) for ingestion orchestration, ensuring high availability and version-controlled configurations via IaC (CloudFormation/CDK).</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Maintain & enhance logging, monitoring, observability & DevOps automation via modern tools such as CloudWatch, Splunk, PagerDuty, Slack, ServiceNow, Snowflake observability features.</span><span> </span></p> </li> </ul> </div> <div> <ol style="list-style-type:decimal" start="2"> <li> <p><span>Platform Prototyping & Enhancement:</span><span> </span></p> </li> </ol> </div> </div> <div> <div> <ul style="list-style-type:circle"> <li> <p><span>Design, prototype, and implement new platform features across AWS, Snowflake, and SageMaker to support innovation in data processing, analytics, and ML operations.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Lead rollout and production hardening of new platform components (e.g., new Snowflake Cortex AI features, SageMaker pipelines, AWS-native services).</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Evaluate and integrate new services or capabilities aligned to StarHub’s data platform roadmap.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Develop technical design standards and documentation for new features and automation processes.</span><span> </span></p> </li> </ul> </div> <div> <ol style="list-style-type:decimal" start="3"> <li> <p><span>Automation & Continuous Improvement:</span><span> </span></p> </li> </ol> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Implement Day 2 platform enhancements including auto-scaling, self-healing workflows, and CI/CD automation.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Enhance EKS cluster performance, pipeline automation, and integration efficiency across cloud and on-prem data sources.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Drive infrastructure-as-code (IaC) adoption for all environments and standardize rollout strategies (blue-green/canary).</span><span> </span></p> </li> </ul> </div> <div> <ol style="list-style-type:decimal" start="4"> <li> <p><span>Governance, Security & Compliance:</span><span> </span></p> </li> </ol> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Enforce enterprise security policies for IAM, VPC isolation, PrivateLink, and encryption (KMS, Secrets Manager).</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Coordinate vulnerability remediation (EKS upgrades, CVE patching, EC2 AMI refresh, Docker image hardening).</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Ensure infrastructure audit readiness in partnership with Information Security (ITSec) and Compliance teams.</span><span> </span></p> </li> </ul> </div> <div> <ol style="list-style-type:decimal" start="5"> <li> <p><span>Operations & Cost Management:</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:lower-alpha" start="1"> <li> <p><span>Monitor and optimize Snowflake warehouse utilization, compute spend, and S3 data lifecycle management.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:lower-alpha" start="2"> <li> <p><span>Maintain tagging, dashboards, and cost visibility frameworks across AWS and Snowflake.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:lower-alpha" start="3"> <li> <p><span>Implement cost governance guardrails and usage quotas across platform components.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="6"> <li> <p><span>Team & Vendor Leadership:</span><span> </span></p> </li> </ol> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Lead a blended team of StarHub engineers and partner vendors responsible for platform sustainment and evolution.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Manage augmented vendor teams, ensuring consistent delivery quality and knowledge transfer to internal engineers.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Build in-house capability in platform engineering, IaC, and automation disciplines.</span><span> </span></p> </li> </ul> </div> </div> <div> <div> <p><span> </span></p> </div> <div> <p><span>Areas of Impact: </span></p> </div> <div> <ol style="list-style-type:decimal" start="1"> <li> <p><span>Scope</span><span>: DXP Data Platform infrastructure (AWS, Snowflake, SageMaker, Datapipe) supporting C360, AI/ML, and analytics workloads enterprise-wide.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="2"> <li> <p><span>Decision Rights</span><span>: Platform design approval, selection of new AWS/Snowflake/SageMaker capabilities, DevOps tooling and IaC framework choices, and vendor performance oversight.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="3"> <li> <p><span>Stakeholders</span><span>: Platform Engineering, Data Engineering, Data Science, Architecture & Governance, Information Security, and Infrastructure teams.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="4"> <li> <p><span>Resources</span><span>: Hybrid team of ~2-4 engineers (StarHub and partner resources) managing infrastructure, platform automation, enhancements, and operations.</span><span> </span></p> </li> </ol> </div> <div> <p><span> </span></p> </div> <div> <p><span>Ideal Track Record: </span><span> </span></p> </div> <div> <ol style="list-style-type:decimal" start="1"> <li> <p><span>8-10</span><span> years of experience in cloud and platform engineering, with extensive experience on AWS-based data platforms.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="2"> <li> <p><span>Proven leadership of cross-functional engineering teams managing production-grade, multi-environment platforms.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="3"> <li> <p><span>Hands-on expertise in:</span><span> </span></p> </li> </ol> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>AWS Services</span><span>: VPC, EC2, S3, RDS, Lambda, KMS, CloudFormation/CDK, Transfer Family, CloudWatch, CloudTrail.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Snowflake</span><span>: administration, RBAC, warehouse optimization, DevOps automation, Cortex AI, and Streamlit integration.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>EKS / Airflow / Airbyte (Datapipe):</span><span> container orchestration, CI/CD pipelines, and deployment automation.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>SageMaker</span><span>: multi-domain setup, pipeline management, Studio/Canvas lifecycle, and MLOps enablement.</span><span> </span></p> </li> </ul> </div> <div> <ul style="list-style-type:circle"> <li> <p><span>Monitoring & Observability</span><span>: CloudWatch, Splunk, Snowflake Account Usage, cost dashboards, PagerDuty, Slack, ServiceNow.</span><span> </span></p> </li> </ul> </div> <div> <ol style="list-style-type:decimal" start="4"> <li> <p><span>Demonstrated success in prototyping, implementing, and scaling new cloud and data platform features into production.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="5"> <li> <p><span>Experience managing Day 2 operations, incident response, and SRE-driven performance stabilization.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="6"> <li> <p><span>Familiarity with machine learning integration and model lifecycle management.</span><span> </span></p> </li> </ol> </div> <div> <ol style="list-style-type:decimal" start="7"> <li> <p><span>Experience enforcing ITSec and compliance standards (IAM, KMS, PDPA/GDPR).</span><span> </span></p> </li> </ol> </div> </div> <div> <div> <ol style="list-style-type:decimal" start="8"> <li> <p><span>Proven success in transitioning platform operations from vendor-managed to in-house ownership.</span><span> </span></p> </li> </ol> </div> </div>