Blog / Why taxonomy is the most effective growth tactic for large catalogue Shopify stores
Why taxonomy is the most effective growth tactic for large catalogue Shopify stores
HOW WE WORK
The most overlooked lever for sustainable growth in a large-catalogue Shopify store isn't ad spend, creative, or a new app. It's the structure of the store itself: taxonomy.
What makes it powerful isn't just that it improves SEO. When the underlying data layer works, every channel that depends on product data starts working at the same time - paid media, internal search, email segmentation, AI discovery. Most stores try to fix these channels one at a time. Fixing the taxonomy fixes them in parallel.
Most brands treat their navigation like a filing cabinet - a place to put things away. But your customers don't want a filing cabinet; they want a personal shopper. When your taxonomy is broken, you aren't just "disorganised" - you're actively hiding products from people who want to buy.
For a large-catalogue Shopify brand, getting taxonomy right is the foundation for growing overall revenue by at least 20% year on year - where the data infrastructure is currently broken. We'll come back to that condition later in the article, because it matters.
1. The world's biggest brands are built on depth of taxonomy
Some of the most successful retail platforms were not built on unique products, but on the superior categorisation of existing ones. Amazon, eBay and Walmart all mastered the same thing: helping people find exactly what they're looking for with minimal friction. They treat taxonomy as a core product feature with dedicated merchandising teams behind it.
The clearest way to see this is the depth they operate at. The number of category levels a customer can drill through is a direct expression of how seriously the platform takes the work of organising its catalogue.
The principles remain the same whether you're a global retailer or a boutique store with 250 products. Over the years, we've worked with brands at every stage of this journey - from those just starting out to teams who were early employees at giants like ASOS and Gymshark. What we've learned is that scale is relative, but the logic is universal: if a customer can't find something in a few clicks, it doesn't exist.
2. The history of taxonomy is a journey from nicknames to standardised data
There is an inherent tension between the human and machine sides of eCommerce, and it's only getting bigger now that agentic commerce is emerging. Taxonomy is the bridge that resolves it.
The most famous example of structured classification is the Linnaean System, the biological hierarchy created by Carl Linnaeus in the mid-1700s. Before Linnaeus, naming a plant or animal was a descriptive mess of nicknames that was impossible to scale. He solved this by creating a structured, machine-readable language for nature - an inverted pyramid that moved from broad kingdoms down to specific species.
The same logic applies in modern eCommerce. We move from a broad department down to a specific SKU, and the hierarchy transforms a flat list of products into a structured database.
If you want a deeper look at the history of how humans have tried to put the world into boxes, this podcast episode covers the topic well.
Why this matters for the future of AI search
This history mirrors the direction of the next decade of search. In traditional SEO, we focused on top-down optimisation - stitching keywords onto pages. AI discovery requires a bottom-up approach. Instead of fixing the page at the end of the line, we're optimising the product data at the source.
Just as Linnaeus replaced nicknames with data, we're moving away from keyword updates toward technical merchandising. AI engines don't just match keywords; they attempt to reason. When an AI agent is asked to find "the best value dog food," it isn't looking for a slogan - it's looking at your ingredients list and price-per-gram to build a justification for its recommendation.
This matters commercially, not just in terms of visibility. Stores already doing the foundational work - clean product feeds, structured attributes, accurate taxonomy - tend to port well to new surfaces. ChatGPT is now building a dedicated ad manager interface for paid placements within ChatGPT responses. The direction of travel across both ChatGPT and Google is toward AI responses that contain a mix of organic results, transactional features, and paid placements. The stores that benefit most when those surfaces open are the ones whose data is already in shape.
3. Fixing your taxonomy has a real impact on revenue
Taxonomy is the logical foundation of your store. But for a business owner or marketing director, the most important question is: what is the actual ROI of being more organised?
To answer it, you have to look at Shopify differently. We treat Shopify as the single source of truth - effectively, the data warehouse for your entire product catalogue. When you optimise your taxonomy within Shopify, you aren't just changing a menu on a website. You're enriching the data that flows downstream to every other channel you use to grow.
Taxonomy is the first domino. When it's poorly structured, it triggers a chain reaction of inefficiencies and lost opportunities. When it's fixed, those same channels start to compound in your favour.
A lack of granularity in Shopify doesn't stay in Shopify. It poisons the well for everything else:
- Paid media and social: Poor data leads to Merchant Center disapprovals, catalogue rejections on Meta, and mismatched price and stock data. The result is ineffective ad targeting and wasted spend.
- SEO: Without clear categorisation, you face crawlability issues and collection bloat, which prevents you from ranking for the specific terms your customers actually use.
- Operations and automation: If your data is messy, you can't automate your workflows. You end up relying on manual tagging and unscalable processes, leading to high operational costs.
- AI search: AI engines rely on structured data to reason. Weak taxonomy leads to hallucinated details or, worse, your products being excluded from AI-generated answers entirely.
The financial formula for growth
The impact of fixing this structure is predictable, with one important condition: it applies when the data infrastructure is genuinely broken. For a store whose taxonomy already works well, the upside from this kind of work is much smaller. Once you've fixed your tracking to give yourself a clean baseline, the two benchmarks we look at are:
- The organic baseline: for a healthy Shopify store, organic revenue should represent roughly 20% of the overall revenue mix.
- The growth ceiling: for a store moving from a flat structure to a deep, granular taxonomy, a doubling of organic revenue is achievable - though only as a floor when the data layer is currently broken, not as a default outcome.
The combined effect, in those circumstances, is roughly a 20% upturn in overall revenue from the improved impact of organic search alone. It's a floor, not a forecast - but it's a floor we've seen consistently when the underlying infrastructure changes from broken to functional.
That number also doesn't account for the reduced wasted spend in PPC, the higher conversion rates from better internal search, the lower bounce rates from improved navigation, or the reduction in manual work for your team. Those gains compound, but they're harder to model in advance.
4. The problem with most taxonomies: data quality and the lack of granularity
If taxonomy is so powerful, why do so many stores get it wrong? In our experience, it usually boils down to two things: poor quality product data and a failure to go deep enough.
Most brands fail to realise that their store structure is only as good as the data feeding it. If your product information is thin - missing materials, specific intents, or technical attributes - you cannot build a granular structure. You're forced to stay shallow because your data doesn't support anything else.
A simple test: do your collections build themselves?
If you want to know whether your taxonomy is working, there's a simpler test than any audit: look at how your collections are built.
If you sit on the right-hand side of that test, the taxonomy work isn't optional - it's the prerequisite for everything else you're trying to do.
The desire path
A useful way of thinking about this is through the lens of a desire path. In urban planning, architects design formal paved roads, but people often ignore them and take the most efficient route instead, wearing dirt trails into the grass. These trails are the physical manifestation of human intent.
Most Shopify stores are built as the rigid, shallow maps. We create broad collections like "Men's Footwear" because that's how we think the world should be organised. But your customers are walking on the grass:
When you don't have the granularity in your product data to pave those paths - material, intent, persona, fit - the customer's search and your store live in different places. Search engines now infer meaning rather than just matching keywords; if your data is brittle, a machine can't reason its way to your products either.
5. Build for the gaps before the data catches up
Where merchants often stall is waiting for "enough data" before they build a new category. We argue for the opposite: build the structure based on logic, even if the products aren't all there yet.
We can learn a lot from the history of the periodic table. Before 1869, chemistry was a disconnected mess of facts. When Dmitri Mendeleev organised the elements, he didn't just list what was already known. He understood the underlying logic so deeply that he famously left gaps in his table - prioritising the structural pattern over the missing data, and predicting the properties of elements like Eka-silicon fifteen years before they were discovered.
| 28.1 Si Silicon |
| ? ? Eka-silicon |
| 118.7 Sn Tin |
When we build a robust taxonomy for a Shopify store, we aim for something similar. By defining clear, granular attributes - material, weight, intent, persona - you create a structure that's bigger than the sum of its parts. A well-structured taxonomy helps search engines (and AI agents) predict the right result even for trait combinations you haven't explicitly named. And by focusing on the logic of how people think, you build a foundation that lasts, rather than constantly reacting to messy, surface-level metrics.
6. Putting this into practice for a large catalogue Shopify store
The principle is one thing, but how do you put it into practice?
The first principle is that Shopify is your point of truth. When you optimise your product data at the source, every downstream channel - from SEO to your Merchant Centre feed to your internal site search - works from the same high-quality foundation. To maintain that point of truth, we follow three rules.
Before going into them, it helps to see what good actually looks like as a structure. The visual below isn't a Blink-named framework, just a useful way of seeing how data should flow through a properly-built store:
1. Stick to the default structure wherever possible
The closer you stay to Shopify's native logic, the easier it is for your data to flow. Shopify has its own Standard Product Taxonomy, and by mapping your items to those global categories, you're using a language that's already machine-readable.
Keeping to the defaults means knowing where your data belongs. This makes sure information feeds correctly into AI platforms and marketing feeds:
- Category structure: use Shopify's native category fields wherever possible to ensure data flows predictably between channels.
- Product Type: always fill this out accurately - it's a primary signal AI platforms use to understand what you sell.
- Metafields vs Tags: as our Head of Delivery, Lauren Harris, sets out in her guide, use metafields for the permanent foundation (the facts of the product) and tags only for merchandising (temporary labels like "Summer Sale"). The most common single failure mode we see is using tags as a substitute for missing metafields.
- GTINs: these belong in the native barcode field - not in a custom metafield, not in a tag, not buried in the description. If a store's GTINs aren't in the barcode field, the governance layer is almost always broken everywhere else too. It's the single fastest way to take a temperature check on whether the data model is being maintained.
2. Map and transform your existing data
Once you've aligned with the default structure, the task becomes a mapping and transformation exercise.
If the data is present but in the wrong place - buried in product descriptions rather than metafields, for example - this is where AI becomes a genuine asset. It can act as a transformation engine, scanning your existing content to extract data and mapping it into the correct fields.
If the data is fundamentally missing, this becomes an operational challenge. We often find that critical information isn't lost - it just isn't being pulled through from a PIM correctly, or it lives exclusively in the heads of your product team. These are specific challenges that need to be addressed before any technical work begins. You can't prompt-engineer your way out of a missing fact; if the information isn't in your business, it won't be in your database.
Part of our process is identifying these gaps and helping you build the rigour needed to capture that data at the source.
3. Customise only where the platform has limitations
We only step away from Shopify's native features when the platform's default logic starts to act as a ceiling. There's one specific area that requires a custom-engineered approach: its flat URL structure.
Shopify was designed for simplicity, which is excellent for smaller catalogues. But it lacks native sub-categories, and this becomes a technical bottleneck as a brand scales. By default, when a customer applies a filter, the URL changes - but the page content stays exactly the same:
Men's Footwear
Search engines see the URL change, but find nothing distinct on the page to index. The high-intent query "men's brown size 10 boots" never gets a page of its own. Because filtered views don't create unique, indexable pages, you're effectively blocked from targeting the specific, granular keywords your customers are actually searching for.
To overcome this, we build the kind of parent-child category structure you'd expect from other platforms. We bridge the structural gap with a few specific technical solutions:
- Secondary navigation and related collections: custom modules that link parent and child categories directly. This is the desire paths idea in practice - paving them for both users and Google. There's a deeper dive in our guide to Related Collections.
- Custom breadcrumb solutions: Shopify doesn't natively track a hierarchical path, so we implement custom breadcrumbs that give search engines a clear map of where a product sits within your taxonomy.
- Custom schema: we use custom schema to explicitly define these relationships in the code, helping AI-driven search engines see the hierarchy that a flat URL structure usually hides.
Final thoughts
Technical limitations aside, the most important shift is a mental one. Taxonomy isn't an administrative chore; it's a discipline. It's the practice of being granular, consistent, and organised enough to respect how your customers actually think.
If you can move from being a digital librarian - filing things away where they belong - to being a technical merchandiser, you unlock a level of growth ad spend alone can't buy. By matching your store's data to the desire paths of human search intent, you make sure your products aren't just on the site, they're actively discoverable. In an era where AI is starting to do the shopping for us, having a structured, logical, deep taxonomy is no longer a nice to have. It's the only way to stay on the map.

