Website Information Gathering Tool

Terminal Guide & Documentation

Website Information Gathering Tool

Introduction

WebExtractor is a specialized OSINT (Open Source Intelligence) tool designed for extracting valuable data from websites directly via the Termux terminal. It allows users to scrape emails, phone numbers, and all available links from a target URL, making it an essential utility for reconnaissance and web analysis.

Prerequisites

Before installing the tool, ensure your Termux environment is ready with the following dependencies:

  • Git: For cloning the tool repository.
  • Python: To execute the tool scripts.
  • Pip: To install the required Python libraries.

Installation Steps

Follow these commands sequentially to set up WebExtractor in Termux:

1. Update & Upgrade Packages

pkg update && pkg upgrade -y

2. Install Git & Python

pkg install git python -y

3. Clone the Repository

git clone https://github.com/s-r-e-e-r-a-j/WebExtractor

4. Navigate to the Directory

cd WebExtractor

5. Install Requirements

pip install -r requirements.txt

Usage & Scanning

Once installed, you can start gathering information by running the tool and following the interactive prompts.

Launch the Tool

python webextractor.py

Interactive Options

After launching, the tool will ask for the following inputs:

  • Target URL: Enter the website URL (e.g., https://www.example.com).
  • Scrape Emails: Enter y to extract email addresses.
  • Scrape Phone Numbers: Enter y to extract contact numbers.
  • Scrape Links: Enter y to collect all internal and external links.

Results will be displayed directly in the terminal window for immediate review.

Explore More Tools

Nmap

Powerful network discovery and security auditing.

WAFW00F

Identify Web Application Firewalls (WAF).

Cool Commands

Visual and fun terminal commands for Termux.