NetSense
April 21, 2026
NetSense is a professional-grade, recursive web scraper designed to build high-quality text datasets for training AI models. It crawls domains, extracts clean text content, and organizes it into structured .txt files while maintaining a minimalist, high-fidelity terminal interface.
Github Repository
Core Features
- Recursive Scraping: Crawls internal links automatically
- AI-Ready Text: Strips code and styles for pure text
- Live Activity Log: High-fidelity real-time scrolling log
- Control Suite: Pause, Resume, and Stop with state preservation
- Native Aesthetic: Zero-background rendering
- Dataset Organization: Saves resources with metadata
Installation
Mac & Linux
curl -sSL https://raw.githubusercontent.com/abubakerx1da49/netsense/main/install.sh | bashWindows
git clone https://github.com/abubakerx1da49/netsense.git && cd netsense && python -m venv env && .\env\Scripts\activate && pip install -r requirements.txt