Software Development Tools in 2026: What a CTO Actually Uses

    Matt Watson
    By Matt Watson · CEO of Full Scale, 4x Founder, Author of Product Driven
    Updated 13 min read

    The toolchain I used as a CTO five years ago barely resembles the one my teams use today. Five years ago we were arguing about IDEs and whether to standardize on Jira. Today my engineers spin up Cursor with Claude Code, wire it into a GitHub Actions pipeline, and ship code that would have taken a sprint in an afternoon. The category shift is real, and it’s still happening.

    I’m in a slightly unusual seat to write this. I founded Stackify, which built two of the developer tools the 2021 version of this post recommended: Prefix and Retrace. I’ve also been the default DBA, the backline debugger, and the guy paged at 2 a.m. at three companies, so I have opinions about what makes a dev tool worth the seat license and what makes it shelfware. I’m Matt Watson, 4x founder, and the CEO of Full Scale, where we hire and retain engineers in the Philippines for US software companies. The team here uses most of the tools below every day. Many we threw out the first year.

    This is what the 2026 software development toolchain actually looks like, category by category, with the honest take on what’s worth standardizing on and what’s still hype.

    What actually changed: AI rewrote the stack

    The thing that changed everything in this category is AI, and it happened faster than anyone running engineering predicted. The 2024 Stack Overflow Developer Survey found 76% of developers either using or planning to use AI tools, up from 70% the year before. The number has only kept climbing. Inside Full Scale, the share of engineers who use an AI coding assistant daily is well above 90%, and we’ve started treating “comfortable in a modern AI workflow” as a hiring criterion alongside the language stack.

    That single shift cascades into every other category on this page. The IDE became the AI IDE. The pull request became an AI-reviewed pull request. The test runner became the AI test generator. Even task management is starting to fold in AI summarization and ticket drafting. So the rest of this guide treats AI as one of nine categories, but it’s also the gravity well bending the other eight.

    A second, quieter shift: the toolchain has consolidated. Five years ago you’d hear teams arguing about which APM to pick from a list of fifteen. Today the practical shortlist is four. Same story in testing, in CI/CD, in version control. The leaders pulled ahead. That’s a gift to engineering leaders who don’t want to spend their weekend evaluating vendors.

    AI coding assistants and agents

    This is the category that did not exist on a list like this five years ago, and now it’s the most important one. The tools fall into two flavors: assistants that sit inside your editor and help you write code, and agents that take a task and run with it on their own.

    On the assistant side:

    GitHub Copilot is still the most widely adopted because of the GitHub integration and the enterprise rollout story. It’s a safe choice for any team already deep in the GitHub ecosystem.

    Cursor and Windsurf are AI-native forks of VS Code that treat the assistant as the primary interface, not a sidebar. Cursor is what most of our Full Scale engineers reach for when they want to work fast in a familiar editor. The composer feature alone changed how I think about scaffolding new modules.

    Claude Code is Anthropic’s command-line coding agent. It lives in your terminal, has access to your repo, and can read, edit, run, and iterate. It’s the tool I personally use the most. For a CTO who still ships code, the CLI-first ergonomics matter.

    On the agent side:

    Devin is the most aggressive bet on a fully autonomous coding agent. You hand it a ticket, it tries to ship the PR. Useful for the right shape of task, not yet ready for unsupervised work on a production codebase.

    Cline and the Copilot Coding Agent represent a middle ground: agentic capability that runs against your codebase but with the developer in the loop. These are where the action is in 2026, and where most teams will land.

    The honest take: AI coding tools do not replace senior developers, they raise the floor of what every developer can produce. That is the real story of AI pair programming: it rewards the developers who understand the code and exposes the ones who do not. The juniors on our team ship work that looked senior in 2022. The seniors are doing system design and review at a much higher rate than they’re writing lines. That’s a structural change. I wrote about how AI is reshaping engineering teams in more depth, including the management implications.

    IDEs and code editors

    The IDE has become a delivery mechanism for everything else on this list. Pick the wrong one and your team loses access to extensions, AI integrations, and the entire plugin ecosystem.

    Visual Studio Code is the dominant editor. It’s free, the extension library is the deepest in the category, and almost every AI tool above ships a VS Code extension first. If you don’t have a strong reason to use something else, this is the default.

    JetBrains IDEs (IntelliJ for Java, PyCharm for Python, WebStorm for JavaScript, Rider for .NET) are the alternative. They’re paid, they’re heavier, and they’re the best at deep language understanding for their target languages. Java and Kotlin teams still default here.

    Cursor and Windsurf are the AI-native forks I covered above. They’re VS Code under the hood, but the AI workflow is the headline feature.

    Microsoft Visual Studio (the full IDE, not VS Code) is still alive for Windows-heavy .NET teams. It’s powerful but it’s also the heaviest tool in this category and increasingly redundant for teams that have moved to .NET on macOS or Linux. If you’re starting fresh on .NET in 2026, look at VS Code with the C# Dev Kit first.

    If you’re standardizing across a new team, my recommendation is: VS Code (or Cursor) as the base, JetBrains for any team that needs deep refactoring on a complex Java/Kotlin/Scala codebase. We covered the broader code editor landscape elsewhere on this blog.

    Version control and code review

    This category has been settled for years on the version control side, but the code review side just got reinvented.

    GitHub is the default. It owns enterprise hosting, it owns most of the open-source world, and it now ships AI code review as a first-class feature through Copilot. The 2024 Octoverse report shows GitHub crossed 150 million developers, which makes it the de facto industry standard.

    GitLab is the alternative for teams that need a single platform that bundles version control, CI/CD, and security scanning. It’s also the right pick if your stack involves self-hosting or you want to avoid Microsoft for procurement or policy reasons.

    Bitbucket still exists, primarily inside Atlassian shops. Beyond a Jira-tight integration, there is no strong reason to pick it for a new team in 2026.

    The interesting change is on the code review side. AI tools are starting to do the first pass on every PR before a human looks at it. Greptile and CodeRabbit are the leaders here, with GitHub’s own Copilot code review baked into the platform. The shift is meaningful: a senior reviewing a PR in 2026 is reviewing a PR that has already been linted, security-scanned, style-checked, and AI-reviewed once.

    For deeper background on the version control side, our version control systems guide is a good read.

    Application performance monitoring

    I’ll caveat this section: I co-founded Stackify in 2012 to build APM tooling. Prefix (the local dev profiler) and Retrace (the cloud APM) were both Stackify products. I sold Stackify and the products still exist, and I have strong opinions about what makes this category work.

    The honest 2026 APM shortlist is short:

    Datadog is the category leader. It does APM, logs, metrics, traces, RUM, and security in one platform. It’s expensive at scale. It is also the safest pick for a team that needs everything wired together and doesn’t want to integrate five separate vendors. Most of our larger Full Scale clients are on Datadog.

    New Relic is the long-running alternative. It re-priced to a usage-based model a few years back, which made it competitive again. The technology is solid. The brand recognition has slipped vs. Datadog.

    Sentry is not strictly APM, it’s error monitoring with performance bolted on. For 80% of teams below a certain scale, Sentry plus log aggregation is the right answer. It’s where I’d start a new SaaS company today.

    Stackify Retrace is still alive under Netreo. If you’re a .NET shop, it’s a category-specific pick that’s cheaper than Datadog. We use it on some Full Scale projects where the cost-to-coverage ratio makes sense.

    The lesson I took from building Prefix is that the tool the developer actually opens during local dev is worth more than the dashboard nobody checks until prod is on fire. Whatever you pick, make sure the local dev experience is good. If your engineers only see APM data when production breaks, you’ve bought a fire alarm, not a monitoring tool.

    Debugging and profiling

    Debugging is the category most people forget to standardize, and then half the team is fighting different log formats and tracing strategies six months later.

    Chrome DevTools is the default for anything front-end. You get the network panel, the performance profiler, memory snapshots, and source maps all in the browser, and every front-end developer should be fluent in it.

    Language-native debuggers (pdb for Python, the Node inspector, the .NET debugger, gdb for C/C++) still matter and most engineers still underuse them. The breakpoint and the watch window beat console.log for any non-trivial bug. We’ve written about the art of debugging at greater depth, including the discipline side that no tool fixes.

    Building a development team?

    See how Full Scale can help you hire senior engineers in days, not months.

    AI-assisted debugging is the new wrinkle. Pasting a stack trace into Claude or Copilot and asking it to triage the root cause is now a normal part of the loop. It is much faster than scanning forums for the same error message. The risk is shallow understanding, which compounds over time. Treat the AI as a co-debugger, not a replacement for actually reading the code.

    Testing and QA

    The testing category got rebuilt twice: once when web apps replaced desktop, and again when Playwright replaced Selenium as the default end-to-end framework.

    End-to-end:

    Playwright is the new default for browser automation. Microsoft built it, the API is dramatically cleaner than Selenium’s, and the auto-waiting alone is worth the migration cost for any team still on Selenium. If you’re starting fresh in 2026, this is the pick.

    Cypress is the runner-up and still strong for teams that want a more all-in-one dev experience. It has weaker cross-browser support than Playwright but a friendlier debugging UX.

    Selenium still exists. If you have a working Selenium suite and no good reason to migrate, leave it. If you’re starting fresh, pick Playwright.

    Unit and component testing:

    Vitest is replacing Jest for new JavaScript and TypeScript projects. It’s faster, it works with modern bundlers, and the Jest-compatible API makes migration painless.

    Jest is still the incumbent. If you have a healthy Jest suite, do not migrate just because.

    AI test generation:

    Tools like Testery, Codium, and Qodo are the new layer. They generate tests from code, fix flaky tests, and assist with coverage. The 2026 state of the art is humans writing the architecture-level tests and AI generating the coverage tests in between. Testery is a Full Scale client; we’ve worked closely with their team on the engineering side, and you can read the Testery case study for the build story.

    For wider context, our roundups of automated testing tools and software testing tools for development teams go deeper into category-by-category picks.

    CI/CD and deployment

    If you don’t have a CI/CD pipeline in 2026, you’re not running a modern engineering team. The good news is that the category has consolidated to the point where the picks almost choose themselves.

    GitHub Actions is the default. It runs in the place your code already lives, the marketplace has thousands of pre-built actions, and pricing is reasonable for most team sizes.

    GitLab CI/CD is the alternative for GitLab shops. Functionally on par with Actions for most teams.

    CircleCI and Jenkins still exist. Jenkins is the legacy default that older enterprise teams are slowly migrating off. CircleCI is the SaaS option for teams that want CI separate from version control. Both are losing share to GitHub Actions.

    On the deployment side:

    Vercel is the default for Next.js and most modern front-end frameworks. Full Scale’s own website runs on Vercel. The DX is the best in the category.

    Render, Fly.io, and Railway are the three-way race for “the new Heroku.” All three are good. Render is the safest pick for most teams. Fly is the right pick if you need global edge deployment.

    Docker is still the container layer underneath everything. Kubernetes is still the orchestration layer for teams that genuinely need it. Most teams don’t. If you’re under fifty engineers and you’re running Kubernetes, ask yourself why.

    Our CI/CD pipeline automation guide and the deployment automation playbook cover the practical setup work.

    Task and project management

    This is the category where the right answer depends most on team size and culture, and where the wrong choice quietly slows a team down for years.

    Linear is the modern winner for startups and small-to-midsize engineering teams. It’s fast, the keyboard shortcuts are well thought out, and the data model fits how engineers actually work. If I were starting a company today, this is the pick.

    Jira is the enterprise default. It’s powerful and deeply customizable, which is both why large companies pick it and why it feels slow once they’re in it. Most large companies use it because they already use it. If you’re switching, pick Linear or stay on Jira; don’t go to a third option.

    GitHub Projects is the right pick for small, engineering-heavy teams that already live in GitHub and want to avoid a second tool. It’s improved a lot in the last two years.

    Notion is not a task manager, but plenty of teams use it as one. For docs plus light project management it’s excellent. As a primary ticket system at scale, it falls over.

    Asana, Monday.com, and Trello are still here. They’re better suited to cross-functional teams (marketing, ops) than to engineering. Engineering teams keep ending up on Linear, Jira, or GitHub Projects.

    We’ve written about the scrum vs. kanban decision and broader agile process choices in case the tool choice surfaces process questions you haven’t answered yet.

    How to pick tools as an engineering leader

    A few rules I’ve learned the hard way running engineering at three companies and watching client teams at Full Scale make and unmake these decisions.

    Standardize, don’t fragment. If two engineers want two different IDEs, fine. If two engineers want two different APMs, you have a problem. The plumbing layer (version control, CI/CD, APM, testing framework) needs to be one choice. The personal-productivity layer (editor, AI assistant) can vary.

    Don’t chase shiny. A new AI coding tool launches every two weeks. Your team cannot adopt all of them. Pick a base layer, give it six months, and re-evaluate. Constant tool-churn is more expensive than the marginal productivity gain from each new tool.

    Treat data leakage like a real risk. AI coding tools send code to a third party. Some of those third parties retain and train on the code. For most teams this is fine. For teams with regulated data, IP-sensitive code, or government contracts, it’s a real problem. Read the data handling policy on every AI tool before rollout. The privacy controls in Tabnine, Cursor’s enterprise mode, and Copilot’s business tier exist for a reason.

    Budget for tools the way you budget for headcount. A modern engineer’s tool stack is somewhere between $100 and $500 per month per seat. That’s real money at scale, and it’s still a bargain compared to losing a sprint to a bad debugger. Don’t be cheap on the wrong line item.

    The tool catalog is not the strategy. A toolchain list is a starting point. What ships software is people, process, and ownership. I wrote a book about this called Product Driven, which goes deeper into how engineering leaders should think about ownership and shipping. The toolchain is the bottom of the stack. Culture is the top.

    The toolchain alone won’t fix a broken team

    Every tool on this page amplifies whatever culture and process you already have. A great team with a mediocre toolchain ships software. A broken team with the best toolchain on the market ships broken software faster. Tools are leverage, not strategy.

    The other thing tools won’t fix: not enough engineers. Most of the engineering leaders I talk to are short on headcount, behind on hiring, and burning out the team they have. The toolchain helps at the margin. The real lever is having enough capable engineers in the right seats. That’s what we do at Full Scale, four years running on the Inc. 5000: we recruit, vet, and retain senior engineers in the Philippines who plug into US teams as full-time team members, not as freelancers or contractors on a project deal.

    If you’re trying to modernize your toolchain and also fill out the team that will use it, book a call with Full Scale or learn more about how we help companies hire developers. The right tools and the right people compound. Either one alone won’t get you there.

    Get Product-Driven Insights

    Weekly insights on building better software teams, scaling products, and the future of offshore development.

    Subscribe on Substack

    Ready to add senior engineers to your team?

    Have questions about how our dedicated engineers can accelerate your roadmap? Book a 15-minute call to discuss your technical needs.