AI Architect
AI Architect – PC Level
About the team:
At Capco, we are dedicated to the financial services industries. Our professionals combine innovative thinking with unrivalled industry and domain expertise to offer our clients consulting expertise, complex technology and package integration, transformation delivery, and managed services, to move their organizations forward. Through our collaborative and efficient approach, we help our clients successfully innovate, increase revenue, manage risk and regulatory change, reduce costs, and enhance controls. Our teams stay at the forefront of industry trends and technologies that are driving innovation. From strategy to launch, we are adept at delivering across the full product lifecycle.
About the Job:
As a member of the Capco GenAI Technology Delivery Team in India, you’ll bring practical knowledge of agile development methodologies and engineering best practices. As an AI Architect, you’ll play an integral role using your experience and skills to contribute to the quality and implementation of our projects where you will design, govern, and continuously improve end-to-end GenAI solutions.
What you’ll do
- Solution Architect & deliver GenAI chat experiences
- Design scalable, low-latency and use your understanding of the structure of various backend systems, such as authentication, authorization, enterprise databases, enterprise APIs, knowledge bases etc., in the context of an end-to-end solution.
- Assess end-to-end bot design and performance.
- Choose and combine LLM access patterns (direct API, fine-tuned models, RAG) to meet accuracy, cost, and governance targets.
- Develop chatbot solutions by utilizing solid understanding of the architectural principles in chatbots for knowledge management, and self-service functionality by invoking APIs for validation, request completion, etc.
- Translate business needs into working solutions aligned with enterprise architecture.
- Embed Retrieval-Augmented Generation (RAG)
- Own schema design for vector stores (Pinecone, FAISS, pgvector, etc, OpenSearch), chunking strategies, and embedding selection.
- Drive LLMOps / MLOps
- Define CI/CD pipelines for prompts, models, and evaluation using MLflow, SageMaker Pipelines, Kubeflow, or equivalent.
- Instrument solutions with OpenTelemetry and automated eval suites for hallucination, toxicity, bias, and drift.
- Fine-tune & align models
- Apply parameter-efficient tuning (LoRA, QLoRA, Adapters) or RLHF where prompt-only falls short, perform offline & online A/B tests.
- Ensure privacy, security & compliance
- Embed DPDPA, GDPR, SOC 2 controls: PII redaction, encryption in transit/rest, RBAC, secure prompt logging.
- Champion Responsible-AI principles—fairness, explainability, auditability.
- Optimize performance & cost
- Profile and tune concurrency, GPU/CPU utilization, response-time budgets; recommend caching, quantization, batching while maintaining high accuracy and high security.
- Lead & mentor
- Run architecture reviews, code walkthroughs, and knowledge-sharing for engineers across India and global locations.
- Liaise with product owners and stakeholders to translate business goals into actionable architecture roadmaps.
What you’ll bring
- 10+ years building distributed/cloud systems, 3+ years in conversational AI or GenAI.
- Deep expertise with Python, Node.js and Java. including strong API-first design mindset (REST/gRPC/GraphQL).
- Hands-on with any two AWS Azure or GCP, plus Docker, Kubernetes, and IaC (Terraform, AWS CDK).
- Proven delivery using LangChain, LlamaIndex, Hugging Face Transformers, and at least one vector store (Pinecone, FAISS, pgvector, OpenSearch, etc).
- Demonstrated MLOps / LLMOps practice: CI/CD for models/prompts, automated testing, canary & rollback, lineage tracking.
- Experience fine-tuning or aligning large models (LoRA, QLoRA, PEFT, RLHF) and benchmarking them.
- Track record implementing observability, cost-guardrails, and guardrails for toxicity/hallucination.
- Understanding of end-to-end solution patterns of chatbot solutions and related trade-offs.
- Familiarity with enterprise compliance frameworks (DPDPA, GDPR) and security architecture.
- Strong communication & leadership skills, able to influence across functions and mentor engineers.
- Implement current trends and best practices in AI/ML.
- Bachelor's degree in computer science, engineering, information technology, artificial intelligence, or a related field. (or equivalent professional experience)
Why Capco?
A career at Capco is a chance to help reshape the competitive landscape in financial services. We launch new banks, transform existing ones, and help our clients navigate complex change. As consultants, we work on the front-end business design all the way through to technology implementation.
We are the largest Financial Services focused consultancy in the world, serving everyone from global banks to emerging FinTechs, from strategy through digital transformation, design, business consulting, data and analytics, cyber, cloud, technology architecture, and engineering.
Capco is a young and growing firm. We maintain an entrepreneurial spirit and growth mindset, and have minimal bureaucracy. We have no internal silos that get in the way of your career opportunities or ability to focus on our clients and make a difference to the business.
We offer the opportunity for everyone to learn rapidly, take on tough challenges, and get promoted quickly. We take pride in our creative, collaborative, diverse, and inclusive culture, where everyone can #BYAW.
Ready to Take the Next Step?
If this sounds like you, we would love to hear from you. This is an opportunity to make a difference and contribute to a highly successful company with a significant growth trajectory.
Apply for this job
*
indicates a required field