3d ago

Technical Product Manager

This is a Staff Product Manager role focused on Generative and Agentic AI, requiring deep fluency in LLMs and agent orchestration. The PM will own the end-to-end product lifecycle for GenAI solutions within the AWS ecosystem, including building evals, tools, and reference architectures. This role requires technical depth, platform ownership, and experience shipping AI products from prototype through production at scale.

Seniority

Staff

Product Area

ai/ml

Work Style

Remote

Location

Role type

AI Native Technical PM Platform PM

Skills

Required

GenAI fluency
agentic AI
LLMs
RAG
prompt engineering
context engineering
evals
Cursor
Claude
Copilot
AWS AI ecosystem
PRDs

Nice to have

Python
JavaScript
TypeScript
Financial Services
Healthcare
Life Sciences
Llama
Mistral

Full job description

Location: US - Remote Department: Delivery At Robots Pencils, we build meaningful, scalable digital products that solve real business problems. We are looking for a Staff Product Manager who combines deep Generative and Agentic AI fluency with hands-on building ability to own AI product outcomes end-to-end. As a Staff PM, you're accountable for initiative-level outcomes, stakeholder satisfaction, and contributing to R P's AI product practice. You think in systems, work backwards from the customer problem, and stay relentlessly curious about what's next in AI Enterprise clients want to deploy Agents - moving from a promising demo to a production system that works at scale, meets security and compliance requirements, and delivers measurable business value is hard. This role owns that problem. You'll be part of a GenAI initiative within the AWS ecosystem, building the evals, tools, patterns, and reference architectures that make AI deployment repeatable. The mindset: prove it works, test assumptions early, and document while building. Key Responsibilities Product Strategy AI Vision Define and drive the product vision, strategy, and roadmap for GenAI solutions - with agentic AI (agent orchestration, tool use, multi-step workflows) as the primary focus - connecting AI capabilities to enterprise business outcomes Translate enterprise problems into structured product requirements; reframe feature requests into outcome-driven priorities with explicit tradeoffs on invest in vs. defer Balance near-term deployment milestones with long-term platform scalability and sustainability Monitor the competitive GenAI landscape and emerging agentic patterns to inform roadmap and technology decisions Discovery Validation Research how enterprise users interact with AI agents and where they lose trust; frame the riskiest assumptions as testable hypotheses and de-risk them first Design and run experiments - POCs, pilot deployments, scenario-based testing of multi-step workflows, edge cases, and failure recovery - to validate agentic solutions where non-deterministic output makes traditional QA insufficient Distill research, experiments, and competitive intelligence into clear insights that pave the path for a successful product Agent Design, Prototyping Production Define agent behavior and prototype system prompts and tool schemas; partner with engineering on context management - summarization, working memory, and information flow across multi-step tasks Drive multi-model architecture tradeoffs with engineering - define the quality, cost, and latency targets that determine which model serves each step in the agent workflow Build AI prototypes to validate hypotheses; define human-in-the-loop boundaries and guardrails - when the agent acts autonomously, when it escalates, and how to handle non-deterministic output Establish agent evaluation frameworks - task completion, reasoning quality, tool selection, failure recovery, safety - and partner with engineering on production readiness (observability, drift, responsible AI, prompt versioning) Define success metrics at the agent level - task completion rate, cost per task (not per inference), escalation rate, time to resolution, and customer trust alongside business KPIs Delivery Execution Own the end-to-end product lifecycle from discovery through phased rollouts; establish the metrics framework (north star, input, guardrail metrics) and report product impact to leadership Manage the product backlog, scope, dependencies, and risks; drive agile ceremonies and produce high-quality PRDs, product briefs, and decision logs Evaluate technology and platform decisions from a product perspective; create deployment playbooks, reference architectures, and knowledge transfer materials so teams sustain solutions independently Use AI to accelerate product work - research, analysis, prototyping, documentation - with judgment on when it needs human oversight; onboard rapidly to new domains and support team members across the initiative Stakeholder Management Build trusted relationships with stakeholders and executives; serve as the go-to product advisor and primary contact for AI product direction and deployment strategy Partner with AWS Solution Architects and account teams to align on technical approach, service selection, and go-to-market for GenAI solutions Manage expectations on scope, timelines, and tradeoffs; facilitate decisions across competing priorities using data, alternatives, and clear rationale Frame AI capabilities and limitations for non-technical stakeholders - manage hype cycles, set realistic expectations; surface unmet needs that deepen relationships and grow the account Required Skills 8-12+ years in product management, forward deployment, or solutions engineering; must have shipped AI products from prototype through production at scale Strong product sense - ability to identify what matters to users and the business, make prioritization calls with incomplete information, and shape products that deliver real outcomes Deep GenAI fluency - LLMs, RAG, fine-tuning, prompt engineering, context engineering, evals - with hands-on experience building or shipping agentic systems (planning, tool use, HITL, guardrails) Proven ability to prototype AI solutions using AI tools (Cursor, Claude, Copilot) to validate hypotheses and de-risk product decisions Experience deploying AI solutions in enterprise environments with strong technical fluency - can read code, evaluate architectures, make product tradeoffs on technical constraints, and drive scalable deployment patterns Exceptional communicator - clear PRDs, technical specs, and decision logs; has led AI products through full lifecycle and driven alignment with Directors, VPs, and C-level Comfortable operating in ambiguous, fast-moving environments where the AI landscape evolves weekly PM-level fluency across the AWS AI ecosystem - Bedrock, AgentCore, SageMaker, Strands, Kendra, OpenSearch, Lambda, Step Functions - to make informed product and architecture decisions Preferred Qualifications Software engineering or coding background (Python, JavaScript, TypeScript) Agency or consulting delivery experience Experience in Financial Services, Healthcare, or Life Sciences industries Familiarity with open-source LLM ecosystem (Llama, Mistral) for flexibility and cost optimization Prior experience leading time-boxed discovery initiatives or technical spikes with rapid validation cycles Why Join R P? You'll work at the intersection of cutting-edge AI and real enterprise impact - helping clients deploy Generative and Agentic AI solutions that change how their businesses operate. R P gives you the variety of consulting (new problems, new industries, new tech) with the depth of a product role - you'll build, ship, and measure, not just advise. The team is collaborative, technically sharp, and genuinely invested in doing great work for clients.

About Robots & Pencils

See all roles →

Robots & Pencils is a digital consulting firm that helps Fortune 100s and growth-stage companies modernize legacy systems and deploy AI agents into operations in 30-45 days. The team blends cloud-native architecture (AWS Advanced Tier and Pattern Partner), human-centered design, data readiness work, and agentic orchestration across education, financial services, healthcare, high tech, and retail. Founded in 2009 in Calgary by Michael Sikorsky, the firm was recapitalized by Next Sparc in 2017, merged with KINETiQ DIGITAL, and later took a 2022 strategic investment from Salesforce Ventures. The team is remote-first across the US and Canada, with parental leave (birth, adoptive, and spouse) and flexible PTO including sick, personal, and flexible holidays.

AI & Machine LearningEnterprise SoftwareParental leaveFlexible hours

Similar Roles

Trellis

Staff Product Manager, Conversational AI

This is a Staff Product Manager role focused on Conversational AI, fully remote within the United States. The PM will own the high-volume, revenue-critical surface of the AI assistant, Abby, extending its reach into new products and customer journeys. This role requires platform ownership, deep experience with conversational design, and an AI-native workflow, with compensation ranging from $159k–$210k per year.

$159k–210k/yrAI Nativeai/mlUS

2w ago

Runway

Staff Product Manager, ML Research

$230k–280k/yrAI Nativeai/mlUS

5w ago

Jump

Staff Product Manager, New Product Area

New

This is a Staff Product Manager role focused on AI operating systems for financial advisors, and is fully remote. The PM will define the vision and strategy for a new product area, specifically focusing on post-meeting cycle financial advisor use cases. This role requires strong entrepreneurial instincts, deep product vision, and involves shaping the product roadmap for the next generation of AI-powered experiences. Compensation is $200K–$220K per year.

$200k–220k/yrAI NativefintechWorldwide

Today

Invoca

Staff Product Manager- AI Platform

New

This is a Staff Product Manager role focused on AI Platform, operating remotely across the US and Canada. The PM will own the product vision and roadmap for the foundational agentic AI execution layer, including orchestration, context management, and governance. This role requires deep technical judgment, platform ownership, and defining durable platform contracts (APIs, schemas) for enterprise-grade AI systems. The salary range is $140k–$208k per year.

$140k–208k/yrplatformGlobal

Yesterday

DigitalOcean

Staff AI Product Manager

New

This is a Staff AI product manager role focused on cloud compute infrastructure and accelerated computing, and it is a remote position. The PM will own the strategic vision and roadmap for DigitalOcean’s GPU product offerings, including defining capacity, provisioning, and deployment workflows. This high-impact role requires deep technical knowledge of GPU architectures (AMD, NVIDIA) and involves a salary range of $186k–$233k per year.

$186k–233k/yrAI NativeinfrastructureUS (CA)

Yesterday

Babylist

Staff Product Manager (AI Builder)

New

This is a Staff Product Manager role focused on consumer e-commerce and AI workflows, with a remote-first status across the United States and Canada. The PM will own major consumer surfaces and core customer journeys end-to-end, setting the quality bar and strategy for the product. This role requires deep technical fluency, operating as a peer-builder, and defining the standards for AI-native product development.

$215k–266k/yrAI NativeconsumerUS

Yesterday