After spending real time with Codex in day-to-day work, I now have a clear view of where it creates leverage and where engineering discipline still has to stay human-led.
Using Codex App feels like pairing with a silent teammate: fast when I am moving fast, patient when I need to think, and quiet when I just want to execute. It helps me think more clearly, ship faster, and stay in flow longer without trying to take the wheel.
What Is Working Well
- Thread organization: Work stays separated by context, so switching between initiatives does not become a giant context blob.
- Speed with coherence:
codex.5-3is fast, but the real win is that it follows the active thread while moving quickly. - Reasoning-level control: The depth dial is practical: deep reasoning when needed, direct execution when speed matters.
- Context handling: Long sessions usually stay tidy and coherent with less manual clean-up.
- Git integration: Useful for momentum and visibility, while terminal-based commit discipline remains intact.
VS Code + Terminal Stay the Control Layer
Codex has not replaced my editor or shell workflow, and I would not want it to. I still review diffs in VIM or VS Code, validate behavior myself, and protect a clean commit history from the terminal.
What changed is the distance between idea and implementation. That stretch is now much shorter, while quality gates and judgment still live in the same place they always did.
Speed Is Good, Architecture Is the Multiplier
Building the wrong structure quickly just gets you to problems faster. Long-term velocity still comes from architecture decisions: system boundaries, ownership, contracts, data flow, failure handling, observability, and testability.
Codex can accelerate implementation, but sustainable delivery still depends on making solid product and architectural calls early and revisiting them with discipline as systems evolve.
Test-First Has Been a Natural Fit
One pattern that seems to work especially well for me is "parallel" test-first building with "Codex in the loop". It's sort of like doing BDD/TDD in parallel for the same change (think parallel reds) followed by validation, followed by feedback until it's good, followed by implementing production code. More about that in a separate blog post.
This may be a redefined, parallel AI-modern XP approach: short feedback loops, frequent validation, and confidence without heroics later. I am still maturing this idea.
Useful Beyond Coding
I have also tested Codex as a sounding board for new business ideas, structured a course, summarized transcripts, and turned half-formed thinking into clear next actions. That points to a genuinely general-purpose model, not just a code assistant.
Automations and Skills
Automations are still potentially very useful, yet I need to explore more. My favorite so far is a dependency sweep that repeatedly catches outdated packages and keeps projects healthy with low operational friction.
Pre-built skills look promising as well. I have not explored them deeply enough yet, but the early signal is strong and worth more reps.
Context and Scope
So far, my usage has been local mode only. This write-up is strictly about workflow and developer experience, and not a statement for or against individual OpenAI usage choices.
Final Take
Most of my usage to date has been on private pet projects, but the leverage is clear enough that I can already see myself teaching teams how to use this approach while keeping engineering standards high.
I came in curious. Now I am properly in.
Building an AI-first engineering team? I write about architecture, XP, and delivery practices that keep speed and quality aligned. Connect with me on LinkedIn.