Extract insights from any website
sget is a modern command-line tool for web data extraction.
Like curl, but interactive, visual, and built for humans.
user@archlinux:~$ sget --info https://archlinux.org
=== Inspected Data ===
[ ]
URL: https://archlinux.org
Status Code: 200
Server: nginx
Content-Type: text/html; charset=utf-8
Page Size: 25030 bytes
[ METADATA ]
Title: Arch Linux
Description: Not found
Logo/Favicon: /static/archlinux_common_style/favicon.png
sget is a frontend built on libcurl.
Built for users who want performance, simplicity, and only what they need; without complex notations.
Visuals
See the output in colorfull way and output which matters.
Structured Output
Get clean JSON output; with all the necessary infos required
Simpilicity
Multiple flags which help you get what you "really" want
Built-in File Support
Extracted data can be easily piped and stored into a file with just one flag.
Simple syntax,
powerful results.
Extract exactly what you need. No need to memorize cryptic flags.
- CSS and XPath selectors
- JSON, CSV, and table output
- Follow redirects & handle auth
- Built-in caching & rate limiting
// Fetch the metadata of a webpage
❯sget --info https://archlinux.org
// Extract only the page title
❯sget --links https://archlinux.org
// Write output to a file
❯sget --json https://archlinux.org -o icon.png
// Output the raw JSON content directly
❯sget --json https://archlinux.org
Ready to fetch?
Get started in seconds. Works on macOS, Linux, and Windows.