Educational Qualification* - B.Tech, MCA, M.Tech
Experience Range: 8-10 years
Primary (Must have skills)* -
Overall Experience- 8+ years in IT with the most recent 2+ years in contribution to Architecture and Design
Microsoft Fabric- 1+ year hands-on experience with Microsoft Fabric on at least one customer engagement. Experience with OneLake, Lakehouses, Fabric Dataflows Gen2, Notebooks, and DirectLake. Ability to design integrated dataflow and lakehouse-based architectures including Medallion layer implementation. Exposure to Fabric Copilot for accelerating data modeling and report creation (good to have).
Azure Data Engineering- 5+ years of hands-on experience with Azure Data Engineering services: Azure Data Factory (ADF), Synapse Analytics, Azure Data Lake Storage (ADLS), and Azure Functions. Demonstrated experience designing ingestion pipelines (Veeva Vault API connectors, SFTP, flat-file), error handling, retry logic, dead-letter queues, and pipeline monitoring dashboards.
Databricks- 4+ years implementing ELT pipelines, notebooks, and ML-based data transformations on Databricks across large-scale distributed datasets.
Power BI- 2+ years delivering 4+ enterprise-grade BI projects using Power BI Pro, Premium, and Paginated Reports. Proficient in DAX, Power Query, M language, RLS, Incremental Refresh, and optimized semantic modelling for Gold-layer consumption.
Generative AI / Copilot- Hands-on experience leveraging GenAI tools (GitHub Copilot, Microsoft Fabric Copilot, M365 Copilot) to accelerate code development, pipeline generation, DAX/SQL authoring, metadata extraction, and documentation automation.
Business Requirement Translation- 2+ years translating business and regulatory requirements into technical specifications and delivering end-to-end BI and data solutions. Experience conducting design workshops with business and IT stakeholders.
Job Description of Role* (RNR) - To be Evaluated by Technical Panel (Define it to give more clarity)
Architecture & Design
Contribute to and execute solution architecture for a GxP-validated Cloud Data Platform on Microsoft Fabric and Azure.
Implement Medallion architecture (Bronze / Silver / Gold) using architect guidance.
Configure OneLake workspace separation across Dev / QA / UAT / Prod with RBAC/ABAC and VNet/private endpoint security.
Support authoring of Architecture Design Documents, Security Design specs, and Validation Plans.
Build reusable semantic (Gold-layer) models for Power BI and downstream analytics consumption.
Data Integration & Pipeline Architecture
Build and maintain ingestion pipelines: Veeva Vault API, flat-file CDM uploads, vendor SFTP/portal automation.
Implement Azure Function-based file validation and virus scanning workflows.
Apply error handling patterns, retry logic, dead-letter queues, and pipeline monitoring dashboards.
Ensure end-to-end data flow from source through Bronze → Silver → Gold layers.
Validation Framework & GxP Compliance
Implement deterministic validation rule engine across Bronze (schema/format/metadata), Silver (business rules/DQ), and Gold (certification workflows).
Support IQ / OQ / PQ execution and documentation.
Execute performance testing (100+ concurrent users, TB-scale datasets) and assist in security/penetration testing.
Ensure deliverables meet FDA 21 CFR Part 11 and GxP requirements.
Immutable Audit Logging & Electronic Signatures
Configure Azure Immutable Blob Storage (WORM) and log routing from Fabric, Azure Monitor, and Key Vault.
Implement retention policies, cryptographic hash chains, and archival tier setup.
Develop application logging for business events, transformation audit logs, access logs (R/Python/SQL/SAS), and vendor activity logs.
Configure electronic signature workflows with approval metadata, non-repudiation, and multi-approver support.
Build audit trail export utilities and FDA eCTD-ready report templates.
Versioning, Lineage & Data Governance
Implement dataset versioning with version comparison and rollback capabilities.
Build end-to-end data lineage tracking from source to Gold layer.
Apply RBAC/ABAC security framework, encryption standards, and audit governance aligned with HIPAA, GDPR, and 21 CFR Part 11.
Analytics & Reporting Platform
Configure Fabric notebook environments (Python, R, SQL, Scala) and SAS 9.4 / RStudio Azure VMs with OneLake mount.
Develop Power BI workspaces with study progress dashboards, DQ scorecards, and compliance dashboards.
Support report certification and distribution processes; maintain Power BI Report Catalogues.
Stakeholder Engagement & Delivery
Participate in architecture design workshops with Business and IT stakeholders.
Translate functional requirements into technical design specifications.
Support milestone sign-offs and coordinate with Data Management, Biostatistics, CDM, and Vendor teams.
CI/CD, DevOps & Monitoring
Implement and maintain CI/CD pipelines and monitoring frameworks for Azure data engineering workloads.
Use Azure DevOps / Jira for Agile/Scrum delivery, sprint planning, and backlog management.
Write KQL queries for forensic log analysis and real-time compliance monitoring.
Generative AI Acceleration
Use GitHub Copilot, Microsoft Fabric Copilot, and M365 Copilot to accelerate pipeline development, DAX/SQL authoring, metadata extraction, and documentation.
Follow and promote GenAI best practices within the delivery team.
Soft skills/other skills - To be Evaluated by Hiring Manager (To define how this will be evaluated)
Communication Skills:
Communicate effectively with internal and customer stakeholders
Communication approach: verbal, emails and instant messages
Interpersonal Skills:
Strong interpersonal skills to build and maintain productive relationships with team members
Provide constructive feedback during code reviews and be open to receiving feedback on your own code.
Problem-Solving and Analytical Thinking:
Capability to troubleshoot and resolve issues efficiently.
Analytical mindset
Task/ Work Updates
Prior experience in working on Agile/Scrum projects with exposure to tools like Jira/Azure DevOps
Provides regular updates, proactive and due diligent to carry out responsibilities
Expected Outcome
We are looking for a seasoned Technical Lead with 8+ years of experience in end-to-end data platform delivery for regulated industries (Life Sciences / Pharma).
. Strong customer engagement skills and hands-on expertise in Azure Data Engineering, Databricks, Power BI, and Microsoft Fabric are essential.
Secondary Skills to be planned Post Hiring - Training Plan
DevOps / DataOps Understanding of DevOps/DataOps practices including CI/CD pipelines, environment management, and deployment automation for data platforms.
Agile / Hybrid Delivery Familiarity with Agile or Hybrid delivery models and collaboration tools (Azure DevOps, Jira, Confluence).
Certifications (Good to Have) Relevant Azure / Data / AI Architect or Engineer certifications (e.g., DP-600 Fabric Analytics Engineer, DP-203 Azure Data Engineer, AZ-305 Azure Solutions Architect Expert).
Life Sciences Domain Deepening knowledge of life sciences clinical data flows, CDISC standards (SDTM/ADaM), and regulatory submission processes (FDA eCTD).