Shoreline Episodes Ep. 3

EP 003 Season 01 · Token economics

Token maxing and the company brain.

A practical guide to organizing company knowledge for AI, choosing the right model and coding tool for each job, managing token costs, and using human judgment to protect quality.

May 31, 2026 39:35 18 chapters Transcript-derived Tool comparisons

The gist

Episode 3 asks what becomes possible when an AI system understands how a company works. The hosts compare two directions: several specialized agents, or one broader agent that can work across support, engineering, email, content, strategy, and operations using shared company knowledge.

They also examine “token maxing”: spending more AI computing power to explore ideas and solve difficult problems. That investment can create value when it serves a clear outcome and includes careful review. It becomes wasteful when teams reward token use itself. These notes turn the discussion into practical steps for organizing context, choosing models and coding tools, prioritizing useful work, and reviewing AI output with care.

Key takeaways

Eight practical ideas to put to work.

01

A company brain needs structure and boundaries.

Collecting documents is not enough. An agent needs current, organized knowledge about roles, systems, values, customers, and what it may read or change.

02

Measure value, not token use.

Exploration can uncover valuable ideas, but usage quotas can reward waste. Judge AI work by the result, learning, and time saved.

03

The coding environment shapes the result.

Codex, Claude Code, Cursor, and routing tools give models different context, permissions, and review workflows. Those differences affect the finished work.

04

Shared context can reduce app switching.

An agent that can safely work across email, documents, code, and CRM data may let people complete routine tasks from fewer interfaces.

05

Match the model to the task.

Use lower-cost models for extraction and early drafts. Reserve stronger models for difficult debugging, architecture, judgment, final review, and high-risk changes.

06

Content can teach company culture.

Prism's founder clips do more than market the business. They show the company's taste, values, product principles, and expectations in public.

07

Start with work that is easy and valuable.

When too many projects compete for attention, score each one from 1 to 3 for ease and value. Begin with the ideas that score 3 on both.

08

Human review protects quality.

People may not name every weak detail, but they notice careless work. AI increases output; clear standards and thoughtful review keep it useful.

Tactical guidesWhat to try next

Guide 01

Map the knowledge your AI needs.

Before building a broad company agent, define the information it needs and the actions it may take.

Separate shared company knowledge from information used by a specific team or function.
Shared knowledge can include the mission, customers, offers, pricing rules, brand voice, values, and decision principles.
Team knowledge can include support scripts, engineering docs, content formats, CRM fields, analytics reports, and recurring processes.
Label each source as read-only, draft-only, approved for changes, or off-limits.
Set a review date for every source so outdated strategy and customer notes do not mislead the agent.

Guide 02

Choose models with a simple rubric.

Spend more on AI reasoning only when the value or risk of the task justifies it.

Label each task by value, risk, reversibility, and review cost.
Use lower-cost or local models for summaries, data extraction, tags, and first drafts.
Use the most capable models for architecture, strategic judgment, difficult debugging, sensitive communication, and final review.
Record the model used, time saved, output quality, review time, and next action.
Stop low-value loops, but give promising, high-value work enough room to succeed.

Guide 03

Compare coding agents on the same job.

Choose tools with a repeatable test. Give Codex, Claude Code, Cursor, and any model router the same real task.

Pick one task with a clear finish line: fix a bug, draft a report, update a page, or triage email.
Run the task in each tool with the same starting context and constraints.
Score setup time, context handling, file edits, command execution, citations, tests, and ease of review.
Count both model costs and the time a person spends reviewing the work.
Keep the tool that makes the finished work easiest to trust.

Guide 04

Move one routine into an agent.

Test the episode's idea by completing one repetitive workflow without moving through several app screens.

Choose a repetitive app workflow, such as email triage, CRM cleanup, meeting prep, or content clipping.
Describe the desired result in one sentence: a ranked queue, draft, summary, or decision memo.
Give the agent access only to the information it needs, then produce the result from one interface.
Review the output before sending messages, changing records, or taking any customer-visible action.
Keep the workflow only if it reduces app switching without hiding important details.

Guide 05

Manage AI experiments as a portfolio.

AI makes small ideas easier to test. Give each experiment a clear limit and a useful output.

Capture an idea you would normally skip because it seems too small or difficult to start.
Give it a time limit, token budget, owner, and clear deliverable.
Build the smallest visible version: page, script, brief, prototype, report, or content format.
Score it on value, ease, timing, customer relevance, and whether it teaches the business something.
Develop the strongest experiments, archive the rest, and save what you learned.

Guide 06

Turn content into company culture.

The Prism example shows how public content can also document the company's principles for employees and agents.

Define the audience and the first idea or feeling each piece should communicate.
Design a repeatable format that is efficient to produce and valuable to the viewer.
Tag each post by principle: product taste, competition, discipline, customer service, speed, or craft.
Reuse the strongest posts in onboarding, hiring, agent instructions, and brand guidance.
Review successful formats to learn what resonated and which lessons should become repeatable guidance.

Guide 07

Prioritize work by ease and value.

Use the episode's simple 1-to-3 scoring system to decide what to do first.

List all candidate features, automations, content ideas, or internal improvements.
Score ease from 1 to 3: how fast, cheap, and reversible is it?
Score value from 1 to 3: how much does it help customers, revenue, learning, or velocity?
Start with ideas that score 3 for both ease and value. Then review the next-highest combinations if capacity remains.
Ask AI to challenge the scores and suggest lower-friction versions of hard ideas.

Guide 08

Create a quality-review loop.

People notice careless work even when they cannot explain what feels wrong. Make human judgment part of the process.

Review AI output at the level of intent, structure, language, visual hierarchy, and tiny details.
Ask what a discerning reviewer would notice after five seconds, five minutes, and five uses.
Remove generic filler, invented details, bloated interfaces, weak claims, and unchecked assumptions.
Compare the result with your best prior work, brand system, founder principles, or customer examples.
Make the agent record what was changed so the next output starts closer to the mark.

ChaptersTimestamp map

00:00

Company knowledge and broad AI agents

The hosts ask how AI can use shared company knowledge across support, engineering, and operations.

↗ 01:17

More context, fewer specialized agents

As models improve, the hosts consider whether one well-informed agent could replace several narrowly focused ones.

↗ 03:29

OpenRouter and model costs

Will asks whether companies are paying for advanced models when lower-cost options could handle the task just as well.

↗ 04:27

Reducing app switching

Enzo explains how Codex and Telegram can become shared interfaces for work that once required several separate apps.

↗ 07:08

Cursor versus Claude Code versus Codex

The hosts compare tools built for a specific model with editors that support several models, then ask which setup produces the most trustworthy result.

↗ 08:14

Why model makers build their own tools

A Tesla analogy explains why model developers may be well placed to design the interface around their own technology.

↗ 11:42

Simplicity outside, complexity underneath

A Tesla interior becomes the metaphor: the best product surface can look simple because the complex system is hidden underneath.

↗ 12:42

Computing limits and a Tesla thought experiment

The hosts speculate about using parked cars for computing power, then return to the cost of turning electricity into AI output.

↗ 16:30

Token use and company incentives

The hosts distinguish useful experimentation from usage quotas that reward spending without creating business value.

↗ 17:37

Naval, Vercel, and founder experimentation

The conversation turns to the kinds of projects founders now try because AI lowers the friction enough to make them testable.

↗ 19:09

Turning more ideas into testable work

AI lowers the cost of testing ideas that once required an employee, contractor, or internal team.

↗ 21:37

Care improves the result

People get more value from AI when they ask specific questions, review the output, and connect it to a meaningful next step.

↗ 22:20

Testing more creative ideas

Most ideas will remain small, but lower testing costs make it practical to explore more of them.

↗ 23:34

Subscription pricing changes behavior

Once a person has paid for a plan, the mindset shifts toward extracting value from the quota instead of fearing each prompt.

↗ 25:03

Prism's repeatable Instagram format

Enzo explains how the team looks for content that is efficient to make and genuinely useful to its audience.

↗ 28:15

Ranking features by ease and value

Will applies the content lesson to product work: prioritize features by how easy they are to build and how much value they create.

↗ 30:31

Content as business culture

The Michelin and Stripe Press examples frame content as a way to educate a market and express a company's values.

↗ 35:00

Founder-athletes, AI coaching, and mastery

The closing stretch connects AI to training, health, jiu-jitsu, pole vaulting, Robert Greene, Josh Waitzkin, and transferable mastery.

↗ 38:56

Why human review still matters

The final point: AI output can work technically while still feeling careless, generic, or poorly considered.

↗

The stackTools & concepts

OpenRouter + token routing

Provider choice · token economics

Model routing sends each task to an appropriate model: lower-cost options for drafts and summaries, stronger models for difficult reasoning and long documents, and a person for final approval.

OpenRouter docs Codex docs Claude Code

Model routing

OpenRouter

A service for comparing AI models and providers, setting fallback options, and balancing cost, speed, and output quality.

Open OpenRouter docs

Coding agent

Codex

OpenAI's coding agent for working in repositories, editing files, running commands and tests, and reviewing changes.

Open Codex docs

Coding agent

Claude Code

Anthropic's coding agent for exploring codebases, editing files, running commands, and completing longer development tasks.

Open Claude Code docs

AI editor

Cursor

An AI-powered code editor that supports several models, making it a useful comparison with model-specific coding agents.

Open Cursor docs

Company brain

Notion API

A practical place to organize shared company knowledge, databases, permissions, and information that agents can read.

Open Notion developers

Protocol

Model Context Protocol

An open standard for connecting AI applications to tools, data, prompts, and external systems.

Open MCP docs

Creative source

The Creative Act

Rick Rubin's book provides context for the episode's discussion of creative ideas and experimentation.

Open publisher page

Mastery

Mastery

Robert Greene's book supports the closing discussion about mastery, craft, and learning from high performers.

Open publisher page

Learning

The Art of Learning

Josh Waitzkin's learning framework offers a useful follow-up to the discussion of applying mastery across different fields.

Open publisher page

Content strategy

Stripe Press

An example of publishing that educates a market while expressing a company's long-term point of view.

Open Stripe Press

Content strategy

Michelin Guide

A historical example of useful publishing that also encouraged people to travel more and, in turn, buy more tires.

Open Michelin history

Video source

Shoreline Ep. 3

The full episode video behind these notes.

Open YouTube episode

Links & references

EpisodeWatch the full episode — YouTube↗ GuideMap the knowledge your AI needs — sources, permissions, review dates↗ GuideChoose models with a simple rubric — spend by value and risk↗ GuideCompare coding agents on the same job — Codex, Claude Code, Cursor↗ GuideTurn content into company culture — public memory and internal training↗ GuideCreate a quality-review loop — remove filler and unchecked assumptions↗ IndexReturn to the Shoreline resource library — homepage↗

“

Useful AI work depends on four things: the context an agent receives, the incentives behind its use, the model chosen for the task, and the care taken during human review.

Shoreline Ep. 3 · distilled operating principle

Listener checklist

Day 1

Inventory company knowledge.

List the documents, databases, posts, transcripts, and systems an agent would need to understand the company.

Day 2

Set a model policy.

Define which tasks need the most capable model, which can use a lower-cost option, and which require human approval.

Day 3

Compare coding agents.

Give Codex, Claude Code, Cursor, or another tool the same real task. Compare the quality and ease of reviewing the finished work.

Day 4

Move one app workflow.

Complete one repeated workflow through an AI agent, then check whether it saves time without hiding important context.

Day 5

Score ease and value.

Rank ten ideas by ease and value. Complete one idea that scores 3 on both before debating the difficult ones.

Day 6-7

Review for quality.

Check the details, tone, structure, evidence, and fit. Remove generic language and correct unsupported assumptions.