CustomFit.ai โ€” Website personalization, A/B testing and CRO for Shopify and D2C
Product
Features
โœฑ
Website Personalization
Adapt to each visitor's behavior & intent
โง–
A/B & Multivariate Testing
Rigorous experimentation
โœจ
AI CopilotNEW
Personalize with a prompt
๐Ÿค–
AI WingmanNEW
Auto-optimize toward winners
๐ŸŽฏ
AI Conversion OptimizerNEW
GPT-grade test ideas
โœŽ
No-Code Visual Editor
Drag-and-drop edit any element
โ–ฆ
Product Recommendations
Personalized recs that lift AOV
โš‘
Feature Flags
Ship safely with kill-switches
โ—ง
Chrome Extension
Edit your store in the browser
โง‰
Shopify, WooCommerce & more
All platform integrations
View all features โ†’
Use Cases
$
Price A/B Testing
Test price points to maximize revenue
โ–ฆ
Theme A/B Testing
Compare whole layouts & designs
๐Ÿ—‚
Template A/B Testing
Test whole PDP/PLP templates
๐Ÿท
Discount A/B Testing
Find the offer that converts
๐Ÿšš
Shipping A/B Testing
Thresholds, speed & copy
โœ
Content A/B Testing
Copy, images & reviews
๐Ÿ’ณ
Checkout Gateway A/B
Payments & one-click
โŒ–
Geo-Based Personalization
Per-location content & offers
โšก
Buyer-Intent Nudges
Exit-intent & retargeting
โ†”
Split-URL / Redirection
Full-page redirect tests
View all use cases โ†’
Solutions & Guides
โคข
Conversion Rate Optimization
The complete CRO guide
โง–
A/B Testing Software
Buyer's guide for D2C
๐Ÿ›’
Cart Abandonment Recovery
Win back lost carts
๐Ÿ“ฐ
Landing Page Optimization
Convert more paid traffic
S
Shopify A/B Testing
Test your store, no code
S
Shopify Personalization
Tailor the store per shopper
โ—”
First-Time Visitor Offers
Convert new shoppers with trust & offers
โ˜…
Repeat-Customer Experiences
Reward and re-engage loyal buyers
โ—Ž
Campaign-Matched Pages
Match the landing page to the ad
โŒ–
Location-Based Experiences
Currency, language & regional offers
Explore CRO โ†’
Customer stories
GIVA
+32%
conversion via personalized recs
GIVA
Mamaearth
+18%
revenue lift from PDP A/B tests
ME
The Sleep Company
+24%
AOV from product recommendations
TSC
Read customer stories โ†’
Integrations
SWsfGA+15
โœฆ
Not sure where to start?
Let AI Copilot pick your first tests

โ€œWe wake up to evidence-backed tests ready to deploy โ€” not a backlog of maybe ideas.โ€

AN
Anirudh S.
Growth ยท Chargebee
โ˜…โ˜…โ˜…โ˜…โ˜…4.8on G2 ยท 2,400+ brands
Talk to our team โ†’
Widgets
Integrations
Ecommerce & Checkout
Shopify
Shopline
Shoplazza
GoKwik
ShopFlo
Razorpay Magic Checkout
Breeze
Shiprocket
View all integrations โ†’
Analytics & Behavior
Google Analytics 4
Microsoft Clarity
Hotjar
Mixpanel
Amplitude
Heap
Adobe Analytics
Segment (CDP)
View all integrations โ†’
Engagement, CRM & More
Klaviyo
MoEngage
CleverTap
WebEngage
HubSpot
Salesforce
Slack
Meta Ads
View all integrations โ†’
CustomersPricing
Resources
CRO
โ–ค
Playbooks
Proven strategies to boost conversions
๐ŸŽ™
Interviews
D2C leaders & marketing experts
โ–ถ
Webinars
Live deep dives & product sessions
Learn
โœŽ
Blog
Tips, experiments & best practices
๐Ÿ“•
Free E-Books
Mastering personalization
๐Ÿ“–
Conversion Glossary
Every CRO term, defined
โœฆAI CopilotNEWLog inBook a demo
Start free trial
Select your platform โ€” Install in 2 minsWe'll tailor the setup
โšก Risk-free 14-day trial ยท No credit card ยท Cancel anytime
S
Shopify
Install from Shopify App Store
โ€บ
W
WooCommerce
Install the WooCommerce plugin
โ€บ
B
BigCommerce
Install from BigCommerce App Marketplace
โ€บ
SL
Shopline
Install from Shopline App Store
โ€บ
M
Salesforce / Magento
Install from the marketplace
โ€บ
SZ
Shoplazza
Install from Shoplazza App Store
โ€บ
WP
WordPress / Webflow
Install plugin or paste the script
โ€บ
โ—ง
Others
Custom-built on React, Next.js, etc.
โ€บ
Tip: pick your platform โ€” we handle the restBook a demo โ†’
Product
Website PersonalizationA/B & Multivariate TestingAI CopilotAI WingmanAI Conversion OptimizerNo-Code Visual EditorProduct RecommendationsFeature FlagsView all features โ†’
Use Cases
Price A/B TestingTheme A/B TestingTemplate A/B TestingDiscount A/B TestingShipping A/B TestingContent A/B TestingCheckout Gateway A/BGeo-Based PersonalizationBuyer-Intent NudgesSplit-URL / Redirection
Solutions & Guides
Conversion Rate OptimizationA/B Testing SoftwareCart Abandonment RecoveryLanding Page OptimizationShopify A/B TestingShopify Personalization
Explore
WidgetsIntegrationsCustomersPricing
Resources
BlogPlaybooksWebinarsInterviewsE-BooksConversion Glossary
Platforms
ShopifyShoplineShoplazzaChrome ExtensionAll integrations
Start free trialBook a demo
Homeโ€บBlogโ€บexperimentationโ€บHow to Build an A/B Test Hypothesis Library

How to Build an A/B Test Hypothesis Library

SJSapna JoharHead of Growth & CRO, CustomFit.aiJanuary 15, 20258 min read
On this page
  1. Why CRO Programs Stall (And How a Library Fixes It)
  2. What Belongs in a Hypothesis Library Entry
  3. Where Good Hypotheses Come From
  4. Building Your Hypothesis Library: The Practical Process
  5. Hypothesis Library Template
  6. Prioritizing with PIE: A Worked Example
  7. Connecting the Library to Your Testing Program
  8. Key Takeaways
0%
How to Build an A/B Test Hypothesis Library

From the conversion glossary

Concepts referenced in this article, defined.

Definition
What Is Hypothesis? Definition & Guide
Definition
What Is Urgency? Definition & Guide
Definition
What Is Social Proof? Definition & Guide
Definition
What Is Control? Definition, Formula & Guide
Definition
What Is Exit Intent? Definition & Guide
โ† Back to Experimentation guide
Try CustomFit.ai

Run A/B tests and personalize your store without code. 14-day free trial, no credit card.

Start free trial โ†’
Share
XLinkedInEmail

Related articles

experimentation

Testing Velocity: How Many Tests Should You Run?

Sapna Joharยท 8 min read
experimentation

Testing Culture: Getting Buy-In from Leadership

Sapna Joharยท 8 min read
experimentation

Quarterly CRO Review: What to Measure

Sapna Joharยท 8 min read

Start lifting conversions today.

Run rigorous A/B tests and personalize every visit on Shopify or any storefront โ€” no engineers required.

Start free trialBook a demo

Built for every D2C category

๐Ÿงด
Skincare
๐Ÿ’„
Beauty
๐ŸŒฟ
Wellness
โ˜•
F&B
๐Ÿ‘Ÿ
Apparel
๐Ÿ’
Jewelry
๐Ÿ›‹๏ธ
Home
๐Ÿผ
Baby
Live ยท Right now
Mamaearth โ€” free-shipping band +12.4% AOVGIVA โ€” festive collection page +34% revenueBellavita โ€” PDP CTA test +27.4% CVRKapiva โ€” Quiz-driven recs +9.48% CTRThe Sleep Co โ€” landing personalized 2ร— capturesPlum โ€” Returning shopper swap +18.2% CVRMamaearth โ€” free-shipping band +12.4% AOVGIVA โ€” festive collection page +34% revenueBellavita โ€” PDP CTA test +27.4% CVRKapiva โ€” Quiz-driven recs +9.48% CTRThe Sleep Co โ€” landing personalized 2ร— capturesPlum โ€” Returning shopper swap +18.2% CVR
Get in touch

Tell us about your store.

We reply within an hour during business hours. No sales pitch, no spam โ€” just answers from someone who's seen 2,400+ D2C stores.

โœ“ Reply within 1 hourโœ“ No spam, everโœ“ Free demo & setup help
โœ“ Thanks! We'll be in touch shortly.
CustomFit.ai

The all-in-one website personalization, A/B testing & CRO platform for high-growth D2C brands. Made by marketers, fueled by coffee.

in๐•โ—Žโ–ถf
Product
  • Features
  • A/B Testing
  • Personalization
  • AI Copilot
  • AI Wingman
  • AI Conversion Optimizer
  • Feature Flags
  • Widgets
  • Integrations
  • ROI Calculator
Platforms
  • Shopify
  • Shopline
  • Shoplazza
  • Salesforce
  • Chrome Extension
  • All Integrations
Resources
  • Blog
  • Playbooks
  • Webinars
  • GrowthFit Interviews
  • Free E-Books
  • Conversion Glossary
  • Case Studies
Compare
  • vs VWO
  • vs Optimizely
  • vs Google Optimize
  • vs Mutiny
  • vs Intelligems
  • vs Shoplift
  • vs AB Tasty
  • vs Convert
  • vs Kameleoon
Company
  • About Us
  • Partners
  • CustomFit Awards
  • Recognition
  • Contact
  • Privacy Policy
  • Terms & Conditions
ยฉ 2026 CustomFit.ai ยท Valley Monks Pvt Ltd ยท Made by marketers, fueled by coffee, and obsessed with conversions.
SOC 2 Type II ยท GDPR ยท CCPA ยท ISO 27001

An A/B test hypothesis library is a structured backlog of test ideas, each written with evidence, expected impact, and implementation notes โ€” ensuring your CRO program runs continuously rather than stalling between tests. Without a library, CRO programs fail not because of bad tools or low traffic, but because teams spend more time figuring out what to test next than actually testing. A well-maintained hypothesis library can sustain 2โ€“4 tests per week indefinitely.

Why CRO Programs Stall (And How a Library Fixes It)

Most A/B testing programs follow the same arc:

  1. Big initial enthusiasm โ€” test 3โ€“4 ideas the team had been thinking about
  2. A win or two validates the approach
  3. The obvious ideas run out
  4. The team starts asking "what should we test next?" in every meeting
  5. Test velocity drops from 4/month to 1/month
  6. The program is quietly deprioritized

The fix is systematic: build your hypothesis library before you need it. Treat it like a product backlog โ€” continuously filled, regularly prioritized, and always a few steps ahead of your current test.

What Belongs in a Hypothesis Library Entry

A hypothesis is not "test the CTA button color." That's a change, not a hypothesis. A proper hypothesis has four components:

Evidence: What data or research supports this idea? Change: What are you proposing to change? Expected outcome: What metric do you expect to improve, by approximately how much? Audience: Who does this affect?

Template:

"Because [evidence], we believe [change] will cause [outcome] for [audience]."

Bad hypothesis:

"Test the add-to-cart button in orange."

Good hypothesis:

"Because heatmap data shows 60% of mobile users on our PDP don't scroll past the first image, we believe moving the add-to-cart button above the product description will increase mobile add-to-cart rate by 15โ€“20% for first-time visitors."

The difference matters because:

  • The good hypothesis forces you to have evidence before testing
  • It sets an expected outcome you can validate against
  • It specifies an audience for segmented analysis
  • If the test loses, you know which assumption was wrong

Where Good Hypotheses Come From

Source 1: Heatmap and Session Recording Analysis

Tools like Microsoft Clarity (free) or Hotjar show exactly where buyers click, scroll, and drop off. Common findings that generate hypotheses:

  • Many visitors click on product images but not the add-to-cart button
  • Mobile users don't scroll past the fold on PDPs
  • Buyers repeatedly click on non-clickable elements (suggesting navigation confusion)
  • High drop-off at a specific step in the checkout flow

Each finding is a hypothesis waiting to be written.

Source 2: Customer Support Tickets

Mine your last 3โ€“6 months of support tickets. Categorize by theme. The most common themes โ€” size questions, delivery questions, ingredient questions โ€” are direct evidence of information gaps that tests can address.

For example: If 30% of support tickets are size questions, your hypothesis might be: "Because size uncertainty drives 30% of support contacts, adding a size recommendation quiz to PDPs will reduce size-related questions and increase conversion for first-time buyers."

Source 3: Post-Purchase Surveys

A simple 3-question survey after purchase captures: why buyers almost didn't buy, what information they wished they had earlier, and what almost made them buy from a competitor. These are gold for generating high-confidence hypotheses.

Source 4: Exit Intent Surveys

Survey visitors who are about to leave without purchasing. "What stopped you from completing your purchase?" The most common answers become hypotheses.

Source 5: Failed Test Analysis

When a test loses, ask why the hypothesis was wrong. The analysis often generates the next hypothesis. A failed "add urgency badge" test might reveal that buyers respond to social proof instead โ€” generating a new test.

Source 6: Competitor and Industry Research

Review competitor stores and industry case studies. CXL, Baymard Institute, and Nielsen Norman Group publish research on ecommerce UX patterns. A finding from Baymard's checkout research ("43% of US adults have abandoned a checkout due to required account creation") can be validated against your own data to generate a hypothesis.

Building Your Hypothesis Library: The Practical Process

Step 1: Set up a shared document or tracking tool

Use Notion, Airtable, Google Sheets, or a dedicated tool like PlanOut. The format matters less than consistency. Every hypothesis entry should have:

  • Hypothesis statement (using the template above)
  • Evidence source
  • Priority score (PIE or ICE โ€” see below)
  • Estimated effort (hours of dev/design work)
  • Related page/funnel stage
  • Test status (backlog, in design, running, completed)
  • Test result (if completed)

Step 2: Run a hypothesis generation sprint

Set aside 2 hours with your team. Review:

  • Last month's analytics data (high drop-off pages)
  • Recent support tickets
  • Any heatmap/recording observations
  • Post-purchase survey responses

Generate 15โ€“20 hypothesis candidates without filtering. Write rough versions first.

Step 3: Refine and write proper hypotheses

Take the raw ideas and write each as a proper hypothesis using the template. This forces you to find or acknowledge missing evidence.

Step 4: Score and prioritize

Use the PIE framework:

  • P (Potential): How much can this improve things? Score 1โ€“10.
  • I (Importance): How much traffic does the affected page/element see? Score 1โ€“10.
  • E (Ease): How easy is it to implement? Score 1โ€“10.
  • PIE Score: Average of the three. Sort by highest.

Or use ICE:

  • I (Impact): Potential impact on your goal metric
  • C (Confidence): How strong is your evidence?
  • E (Ease): Implementation effort

Step 5: Maintain the library weekly

Assign someone to review the library weekly. New hypotheses should be added as evidence emerges. Completed tests should be documented with results. The library should never empty โ€” when you're running 4โ€“6 tests/week, you need 4โ€“6 new ideas per week coming in.

Hypothesis Library Template

Use this structure for each entry:

HYPOTHESIS #[number]
Status: [Backlog / In Design / Running / Complete]
Priority Score: [PIE/ICE score]

Hypothesis Statement:
Because [evidence], we believe [change] will cause [outcome] for [audience].

Evidence Sources:
- [Source 1: e.g., Heatmap data showing 55% mobile drop-off at image scroll]
- [Source 2: e.g., Support ticket analysis โ€” 20% of tickets are size questions]

Change Description:
- Control: [What exists today]
- Variant: [What you'll test]

Success Metric: [Primary KPI, e.g., add-to-cart rate]
Secondary Metrics: [e.g., CVR, session duration]

Estimated Traffic Required: [From sample size calculator]
Estimated Implementation Time: [Hours]

Test Results (after completion):
- Duration:
- Winner/Loser/Inconclusive:
- Result magnitude:
- Learnings:

Prioritizing with PIE: A Worked Example

Suppose you have three hypotheses for your Shopify PDP:

Hypothesis A: Add size guide tooltip near size selector

  • Potential: 8 (size anxiety is a major drop-off driver)
  • Importance: 9 (all PDP visitors see the size selector)
  • Ease: 8 (tooltip is low implementation effort)
  • PIE Score: 8.3

Hypothesis B: Add video testimonials below the fold

  • Potential: 6 (social proof helps but buyers may not scroll)
  • Importance: 5 (only buyers who scroll see it)
  • Ease: 4 (video production required)
  • PIE Score: 5.0

Hypothesis C: Reorder PDP sections: put ingredients before product description

  • Potential: 7 (supplements buyers specifically care about ingredients)
  • Importance: 7 (all PDP visitors affected)
  • Ease: 9 (just reordering existing content)
  • PIE Score: 7.7

Test order: A, then C, then B.

Connecting the Library to Your Testing Program

Your hypothesis library connects to your testing roadmap. Each sprint (typically 2โ€“4 weeks), you pull the top-scoring hypotheses from the library, design the test, implement it via CustomFit.ai or your chosen platform, and run it.

The test results feed back into the library:

  • Winners generate follow-up hypotheses ("what else can we optimize on this page now that we've improved the CTA?")
  • Losers generate diagnostic hypotheses ("the urgency badge didn't work โ€” was it because urgency isn't a barrier, or because buyers didn't trust the countdown timer?")

A mature hypothesis library becomes a record of your brand's conversion intelligence โ€” every test, every result, every learning documented in one place.

Key Takeaways

  • A hypothesis library prevents CRO programs from stalling between tests โ€” treat it like a product backlog
  • Write hypotheses with four components: evidence, change, expected outcome, and audience
  • Generate hypotheses from heatmaps, support tickets, surveys, failed test analysis, and competitor research
  • Use PIE or ICE scoring to prioritize which hypotheses to test first
  • Maintain the library weekly โ€” it should always have 4โ€“6 weeks of ready-to-test ideas
  • Document test results in the library; failed tests are as valuable as winners if properly analyzed