Extract insights from any website

sget is a modern command-line tool for web data extraction. Like curl, but interactive, visual, and built for humans.

sget

user@archlinux:~$ sget --info https://archlinux.org

=== Inspected Data ===

[ ]

URL: https://archlinux.org

Status Code: 200

Server: nginx

Content-Type: text/html; charset=utf-8

Page Size: 25030 bytes

[ METADATA ]

Title: Arch Linux

Description: Not found

Logo/Favicon: /static/archlinux_common_style/favicon.png

sget is a frontend built on libcurl.

Built for users who want performance, simplicity, and only what they need; without complex notations.

Visuals

See the output in colorfull way and output which matters.

Structured Output

Get clean JSON output; with all the necessary infos required

Simpilicity

Multiple flags which help you get what you "really" want

Built-in File Support

Extracted data can be easily piped and stored into a file with just one flag.

Simple syntax, powerful results.

Extract exactly what you need. No need to memorize cryptic flags.

  • CSS and XPath selectors
  • JSON, CSV, and table output
  • Follow redirects & handle auth
  • Built-in caching & rate limiting

// Fetch the metadata of a webpage

sget --info https://archlinux.org

// Extract only the page title

sget --links https://archlinux.org

// Write output to a file

sget --json https://archlinux.org -o icon.png

// Output the raw JSON content directly

sget --json https://archlinux.org

Ready to fetch?

Get started in seconds. Works on macOS, Linux, and Windows.