Dataset Verification

View as Markdown

Dataset verification is about more than validity checks — it’s about understanding your contacts in context. ApexVerify’s targeting parameters let you tell the system who you’re targeting and why, enabling more accurate scoring and better-quality results.


When to use dataset verification

CRM hygiene

Periodically verify your CRM contacts to remove invalid records before they cause deliverability issues, wasted ad spend, or inflated pipeline numbers.

Marketing list cleaning

Clean subscriber lists before sending campaigns. Reduce bounce rates, protect your sender reputation, and ensure your data complies with email marketing regulations.

Lead generation validation

Verify incoming leads from forms, ad platforms, or data providers before they enter your CRM. Stop paying to enrich or engage bad data.

Data provider enrichment

Validate data purchased from third-party providers. Understand exactly what percentage is deliverable before committing to a full integration.


Dataset preparation checklist

1

Choose the correct verification type

Decide whether you’re verifying emails or phone numbers. Each batch handles one type. If you have both, create two separate batches.

2

Normalize your data

Before exporting:

  • Trim whitespace from all values
  • Lowercase all email addresses
  • Format phone numbers in E.164 (+[country code][number])
  • Remove empty rows
3

Export as a TXT file

Export your cleaned data as a plain .txt file — one entry per line, UTF-8 encoded. Most CRMs and spreadsheet tools can export a single column directly.

4

Set targeting parameters

Targeting parameters aren’t required, but they give the verification engine important context. The more accurately you describe your audience, the better the scoring signal.


Targeting parameters explained

Targeting parameters are set via PUT /v1/batch/{uuid} after upload. Only type and target_country are required.

ValueDescription
emailVerify email addresses
phoneVerify phone numbers

An ISO 3166-1 alpha-2 country code representing the country your audience is based in. Examples: US, GB, SG, AU, DE, FR.

This helps the engine apply country-specific routing rules, carrier lookups, and domain heuristics.

Describe the professional role of your target audience. Pass the corresponding integer value:

CodeAudience
1Agency / Consultants
2Creators & Influencers
3Customer Support & Success
4Developers & Technical Leads
5E-commerce & Retail Managers
6Enterprise Decision-Makers
7Finance & Operations Professionals
8Founders & C-Level Executives
9Freelancers & Solopreneurs
10Government & Public Sector Employees
11HR & Talent Acquisition
12Healthcare Professionals
13Hobbyists & Enthusiasts
14Investors & VCs
15Job Seekers
16Legal Professionals
17Marketing & Sales Professionals
18Non-Profit Leaders & Staff
19Parents & Families
20Product & Project Managers
21Real Estate Professionals
22Small-to-Medium Business (SMB) Owners
23Students & Educators

The industry of your target market. Pass the corresponding integer:

CodeIndustry
1Accommodation
5Agriculture, Forestry, Fishing and Hunting
15Chemical Manufacturing
17Computer and Electronic Product Manufacturing
18Construction
23Data Processing, Hosting, and Related Services
25Educational Services
29Finance and Insurance
42Health Care and Social Assistance
46Information
54Manufacturing
83Professional, Scientific, and Technical Services
86Real Estate
91Retail Trade
101Telecommunications
109Utilities

The full list of 115 industry codes is available in the API Reference.

The goal of the outreach campaign for which this data will be used:

CodeObjective
1App Installs / Downloads
2Audience Building (Email/Community)
3Brand Awareness & Recall
4Customer Acquisition / Sales
5Free Trial or Freemium Signups
6Lead Generation (Top/Mid-Funnel)
7Loyalty & Referral Program Growth
8Market & Customer Research
9New Market Entry
10Partner & Affiliate Recruitment
11Product Engagement & Feature Adoption
12Sales-Ready Leads (Bottom-Funnel)
13Upsell / Cross-sell Revenue
14User Retention & Churn Reduction
15Waitlist / Pre-order Signups
16Website / App Traffic Acquisition

Control how aggressively the system reuses prior verification results.

ParameterDefaultDescription
use_account_cachetrueReuse your own previous results
max_account_cache_backoff30 daysHow far back to look in your account cache (1–180)
use_global_cachetrueReuse anonymized results from other accounts
max_global_cache_backoff30 daysHow far back to look in the global cache (1–180)

Both caches deliver a 50% credit refund for cache-hit records. Account cache is checked first.


Email vs. phone datasets

What email verification checks

  • Syntax — RFC 5321/5322 compliance
  • Domain — MX record existence and validity
  • SMTP — Mailbox existence probe
  • Provider flags — Free provider (Gmail, Yahoo…), role address (info@, admin@…), disposable, catch-all

Key result fields

FieldWhat it means
quality: "good"Safe to send — deliverable and not flagged
quality: "risky"May deliver but has risk signals (catch-all, role, free)
quality: "bad"Do not send — invalid or known bad
is_catch_all: trueDomain accepts all addresses — individual deliverability unknown
is_disposable: trueOne-time inbox service — likely low engagement