
Tech C2C Jobs
Job Title: Generative AI Solution Architect
Location: Torrance, California (onsite)
Job Summary
Mandatory sills: Architectural diagrams
Key responsibilities:
Design multilayered AI solutions balancing compute efficiency contextual fidelity and algorithmic adaptability retrieval reasoning planning tool use
Develop advanced RAG pipelines leveraging vector databases e.g. ChromaDB Milvus FAISS and embedding strategies for contextual accuracy
Integrate AI capabilities with enterprise systems via REST and GraphQL APIs ensuring secure and scalable interoperability
Establish best practices for algorithm selection and layering combining neural models symbolic reasoning and tool based agents for optimal performance
Collaborate with cross-functional teams to embed AI agents into business workflows and align with compliance and governance standards
Implement structured output validation and schema enforcement using Pydantic FastAPI and JSON Schema for robust data integrity
Optimize compute resources and latency tradeoffs across cloud hybrid and edge environments for high performance AI workloads
Define observability baselines telemetry tracing evaluation metrics and rollout strategies for safe iterative deployments
Required skills:
Programming Architecture Python 310 Async design modular distributed architecture microservices
LangChain Ecosystem, LangChain LangGraph, prompt templates, agent orchestration patterns
AIML Frameworks OpenAI API HuggingFace Transformers TensorFlow PyTorch experience with finetuning and inference optimization
Data Context Management SQLAlchemy PostgreSQL JSON Schema Mapping feature engineering and contextual pipelines
Vector Databases Semantic Search ChromaDB Milvus FAISS embedding optimization and similarity search strategies
Algorithmic Design Strong understanding of algorithm layering retrieval reasoning planning hybrid AI approaches and compute aware model selection
API Integration Security REST GraphQL OAuth enterprise grade security practices
DevOps CICD Git Docker Azure DevOps or equivalent FastAPI Uvicorn containerized deployments and automated pipelines
Agentic Capabilities Reasoning adaptation tool calling MCPbased solutions ReAct agents Supervisor multiagent coordination
Performance Optimization Distributed compute strategies GPUTPU utilization quantization pruning distillation caching batching
To apply for this job email your details to rahul.pandey@quantumworldit.com