Website Information Gathering Tool
Introduction
WebExtractor is a specialized OSINT (Open Source Intelligence) tool designed for extracting valuable data from websites directly via the Termux terminal. It allows users to scrape emails, phone numbers, and all available links from a target URL, making it an essential utility for reconnaissance and web analysis.
Prerequisites
Before installing the tool, ensure your Termux environment is ready with the following dependencies:
- Git: For cloning the tool repository.
- Python: To execute the tool scripts.
- Pip: To install the required Python libraries.
Installation Steps
Follow these commands sequentially to set up WebExtractor in Termux:
1. Update & Upgrade Packages
pkg update && pkg upgrade -y
2. Install Git & Python
pkg install git python -y
3. Clone the Repository
git clone https://github.com/s-r-e-e-r-a-j/WebExtractor
4. Navigate to the Directory
cd WebExtractor
5. Install Requirements
pip install -r requirements.txt
Usage & Scanning
Once installed, you can start gathering information by running the tool and following the interactive prompts.
Launch the Tool
python webextractor.py
Interactive Options
After launching, the tool will ask for the following inputs:
- Target URL: Enter the website URL (e.g.,
https://www.example.com). - Scrape Emails: Enter
yto extract email addresses. - Scrape Phone Numbers: Enter
yto extract contact numbers. - Scrape Links: Enter
yto collect all internal and external links.
Results will be displayed directly in the terminal window for immediate review.