Guides

Archiving Dark Web Marketplace Pages for Analyst Research

How OSINT researchers archive dark web marketplace listings with timestamps, folder structures, and structured export using PageStash in Tor Browser.

P
PageStash Team
April 11, 2026
9

Archiving Dark Web Marketplace Pages for Analyst Research

Dark web marketplaces are among the most volatile sources an OSINT analyst encounters. Entire platforms disappear overnight—exit scams, law enforcement takedowns, or operators simply pulling the plug. Individual listings rotate even faster. If you're documenting vendor activity, pricing trends, product descriptions, or communication channels for research purposes, you need an archival workflow that captures content the moment you see it.

This guide walks through a structured approach to archiving dark web marketplace pages using PageStash's Firefox extension in Tor Browser, with a focus on case organization, timestamped evidence, and structured data export.

The Archival Problem with Dark Web Marketplaces

Traditional web archival doesn't work here:

  • No crawl access—marketplace pages sit behind Tor, often behind authentication walls
  • No URL permanence—.onion addresses change, mirrors rotate, pages get pulled
  • Content is time-sensitive—a listing that exists today may be gone in hours
  • Screenshots alone aren't enough—you need searchable text, extracted entities, and metadata

Analysts who rely on bookmarks or manual screenshots end up with disorganized folders of PNGs with no searchability, no timestamps tied to content, and no way to connect observations across captures.

Setting Up Your Archival Workspace

Folder Structure for Marketplace Research

A disciplined folder structure prevents chaos as captures accumulate. Here's a proven layout:

  • Top level: Investigation name (e.g., "DarkMarket Monitoring 2026")
    • Marketplaces (one sub-folder per platform)
      • Vendor Profiles — captured seller pages with reputation data
      • Product Listings — individual item pages with descriptions and pricing
      • Forum/Community — discussion threads, dispute resolutions, announcements
      • Infrastructure — login pages, mirrors, PGP-signed canaries

Tagging Strategy

Apply tags that enable cross-marketplace analysis:

  • By substance/category: pharmaceuticals, fraud-tools, counterfeit, malware
  • By vendor alias: tag with the vendor's handle for quick filtering
  • By currency: bitcoin, monero, ethereum
  • By data type: pricing, vendor-comms, pgp-key, reviews

The Capture Workflow

Step 1: Navigate and Assess

In Tor Browser, navigate to the target marketplace page. Before capturing, assess what's on the page:

  • Is this a listing page with pricing, product description, and vendor info?
  • Is this a vendor profile with reputation scores, PGP keys, and contact methods?
  • Is this a forum thread with communication between actors?

This determines which folder and tags you'll apply.

Step 2: Capture with PageStash

Click the PageStash extension icon. The capture grabs three layers of data:

  • Full-page screenshot — visual proof of exactly what the page looked like at capture time
  • HTML source — the underlying page structure, useful for identifying hidden elements or metadata
  • Extracted text — all visible text content, made searchable across your workspace

The timestamp is recorded automatically. This is critical for marketplace research—you need to establish when a listing existed, not just that it existed.

Step 3: Organize Immediately

File the capture into the correct folder and apply tags before moving on. Backlog organization is a debt that compounds fast. With 50 untagged captures, you'll spend more time organizing than you saved by skipping it.

Step 4: Review Entity Extraction

After capture, check what PageStash's entity extraction pulled automatically:

  • Crypto wallet addresses — Bitcoin (1/3/bc1), Monero, Ethereum addresses from listing pages
  • Onion addresses — .onion URLs for mirrors, vendor shops, or communication channels
  • Email addresses — clearnet or encrypted mail service contacts
  • PGP fingerprints — seller verification keys
  • Social handles — Telegram, Jabber, Wickr, Session IDs

These entities are indexed and searchable. When the same wallet address appears across multiple vendors or marketplaces, PageStash surfaces that connection.

Building a Research Database

Search Across Captures

With PageStash's full-text search, you can query across all archived marketplace pages:

  • Search for a vendor alias to find every page where they appear
  • Search for a wallet address to trace payment infrastructure across listings
  • Search for product keywords to track what's being sold and where
  • Search for price points to document pricing trends over time

Knowledge Graph Connections

PageStash's knowledge graph visualizes relationships between extracted entities across your captures. For marketplace research, this reveals:

  • Vendor networks — same PGP key or wallet across multiple aliases
  • Infrastructure overlap — shared .onion addresses between marketplace mirrors
  • Communication channels — common Telegram groups or Jabber servers referenced across listings
  • Financial flows — wallet addresses that connect seemingly separate vendor operations

Export for Analysis and Reporting

When it's time to produce deliverables, PageStash supports multiple export formats:

  • CSV — structured rows with capture metadata, timestamps, extracted entities, and tags. Import into Excel, Google Sheets, or a database for quantitative analysis
  • JSON — full data structure for programmatic processing, integration with OSINT platforms, or custom analysis scripts
  • Markdown — formatted reports with embedded metadata, suitable for briefings and documentation
  • HTML — self-contained visual archives that stakeholders can open in any browser

Maintaining Archival Discipline

Capture Cadence

For active marketplace monitoring, establish a regular capture schedule:

  • Daily for high-priority targets or fast-moving investigations
  • Weekly for broader landscape monitoring
  • Event-driven for new marketplace launches, exit scam indicators, or takedown aftermath

Version Tracking

Capture the same pages at regular intervals. PageStash's timestamped clips let you reconstruct a timeline of how listings, pricing, or vendor profiles changed over time. This chronological record is valuable for trend analysis and reporting.

Data Retention

Define retention policies aligned with your organization's guidelines. PageStash folders and export make it straightforward to archive completed cases and retain only active investigation data in your workspace.

Ethics and Legal Disclaimer

This guide is for lawful, passive research only.

  • Do not create accounts on illegal marketplaces or engage in transactions
  • Do not purchase illegal goods or services, even for "research purposes"
  • Passive observation of publicly accessible .onion content may be lawful in many jurisdictions, but consult legal counsel for your specific situation
  • Follow your organization's research policies, ethics board requirements, and applicable law
  • Do not retain content that is illegal to possess in your jurisdiction

Dark web marketplace research serves legitimate purposes in threat intelligence, academic study, policy analysis, and journalism. The line between observation and participation must be respected absolutely.

Build Your Marketplace Research Archive

PageStash gives analysts the structured archival workflow that dark web marketplace research demands. Timestamped captures, automatic entity extraction, case-based folder organization, and multi-format export—all running natively in Tor Browser.

Start archiving with PageStash and stop losing volatile marketplace data to broken bookmarks and unnamed screenshots.

TOPICS

Tor
OSINT
dark-web
Firefox
investigation
PageStash
marketplace
archival
web-archiving
evidence

Put These Tips Into Action

Start organizing your research with PageStash. Sign up for your free trial—10 clips/month included.