Solution Architect – AI Infrastructure & Private Cloud
Bengaluru, Karnataka, India முழு நேரம்
முதல் ஆளாக விண்ணப்பிக்கவும்
- அனுபவம்
- ஏதேனும்
- சம்பளம்
- —
- காலியிடங்கள்
- 1
- பதிவுசெய்யப்பட்டது
- 2 நாட்கள் முன்
Where you'll work
பணி விளக்கம்
About Aziro
Aziro follows a distinct operating philosophy known as the Aziro Way, which guides how the company innovates, collaborates, runs its business, and creates value for clients. The organization specializes in IT services and domain-focused automation software, with strong capabilities in storage, servers, disaster recovery/business continuity, and virtualization. In addition to services, Aziro provides enabling products, including end-to-end domain-specific software solutions for test automation. With a vision to be at the center of industry transformation, Aziro aims to act as a catalyst for constructive change and improved software and business processes.
Role Overview
The organization is looking for an experienced Solutions Architect with strong depth in AI/ML infrastructure, high-performance computing, and container platforms. This role focuses on designing, deploying, and improving private cloud environments for HPE Private Cloud AI and enterprise AI factory solutions. The position is central to building scalable, secure, and high-performance AI platforms using HPE GreenLake and NVIDIA AI Enterprise technologies, along with validated HPE reference architectures and partner ecosystems.
Key Responsibilities
- Provide delivery assurance and act as the primary design authority for enterprise container platforms, private cloud AI environments, and HPC/AI solutions.
- Align architecture decisions with customer AI/ML strategy, business goals, and NVIDIA Enterprise AI Factory design principles.
- Plan and manage risk, stakeholder expectations, and delivery activities across the full project lifecycle.
- Design and optimize solutions across container orchestration and HPC workload management using tools such as Red Hat OpenShift, SUSE Rancher, Slurm, and Altair PBS Pro.
- Integrate container and AI platforms with NVIDIA AI Enterprise, DevOps tooling, AI/ML frameworks, and open-source ecosystem components.
- Handle technical participation in RFPs, RFIs, and customer-facing solution discussions.
- Lead proof-of-concept engagements to confirm feasibility, integration, and performance in customer environments.
- Assess customer infrastructure and workload requirements and recommend appropriate configurations based on validated HPE and partner reference architectures.
- Stay updated on advancements in HPC, Kubernetes, hybrid cloud, security, and related infrastructure technologies.
- Advise enterprise customers by translating technical capabilities into clear business value.
- Work with infrastructure specialists and data science teams to deliver integrated solutions.
- Guide and mentor technical consultants and contribute to knowledge-sharing forums such as tech talks and innovation sessions.
Required Experience and Technical Background
- Strong hands-on expertise in HPC systems, AI infrastructure, and cluster workload schedulers such as Slurm and/or Altair PBS Pro.
- Practical experience with HPC cluster management tools such as HPE Cluster Management and/or NVIDIA Base Command Manager.
- Good understanding of high-speed networking, including InfiniBand, Mellanox, and Ethernet, along with performance tuning of HPC components.
- Extensive containerization experience with Docker, Podman, and Singularity.
- Working knowledge of at least two container orchestration platforms, such as CNCF Kubernetes, Red Hat OpenShift, SUSE Rancher, or Canonical Charmed Kubernetes.
- Understanding of GPU enablement and monitoring technologies such as NVIDIA GPU Operator and DCGM.
- Strong Linux administration skills, including package handling, boot troubleshooting, system tuning, and network setup.
- Hands-on experience with at least two Linux distributions among RHEL, SLES, and Ubuntu.
- Exposure to virtualization technologies such as KVM and OpenShift Virtualization.
- Knowledge of hybrid cloud architecture and major cloud platforms working alongside on-premises systems.
- Familiarity with DevOps methods, including CI/CD, infrastructure as code, and microservices delivery.
- Experience integrating open-source AI/ML tools and supporting the full model lifecycle from development through deployment.
- Understanding of cloud-native security, observability, and compliance for reliable AI/ML operations at scale.
- Solid grasp of networking fundamentals such as DNS, TCP/IP, routing, and load balancing.
- Working knowledge of data and storage protocols such as S3, NFS, and SMB/CIFS.
- Proficiency in Python and Bash scripting, with experience automating infrastructure and AI workflows.
- Excellent problem-solving, analytical, and communication skills for both technical and non-technical audiences.
Eligibility
Any graduate can apply for this position.
Additional Information
This role is based in Bengaluru, India. The posting does not specify a stipend or salary, number of openings, start date, application deadline, or work schedule details.