127 private links
CLI tool for exploring arXiv (inspired by karpathy's brilliant ArXiv Sanity Preserver)
The script will create data/pdf/, data/txt/ and data/summary/ directories to hold files downloaded from arXiv. I am also aware that this is a rather stupid way to implement a datastore but DBs seem a bit over the top. Text from PDFs are auto-converted on downloaded and are used to suggest future articles to the user. Downloading articles is idempotent.
A command line program for getting Wikipedia summaries easily.
From the creators of ack, a list of tools that are related with ack
. In particular they list:
- Tools that work with
ack
- Other grep-like tools
- Indexing tools
rga
is a line-oriented search tool that allows you to look for a regex in a multitude of file types.
rga
wraps the awesome ripgrep and enables it to search in pdf, docx, sqlite, jpg, movie subtitles (mkv, mp4), etc.
rga (or ripgrep-all) is a command line tool to recursively search all files in a directory for a regex pattern, that runs on Linux, macOS and Windows. It's a wrapper for ripgrep, the line-oriented recursive search program, on top of which it enables search in a multitude of file types like PDF, DOCX, ODT, EPUB, SQLite databases, movies subtitles embedded in MKV or MP4 files, archives like ZIP or GZ, and more.
Fast and powerful open source alternative to grep; it targets flexibility and performance: can be as fast as "regular" grep and allows to specify complex expressions to find text.
Greps regular expressions in a text file(s) and prints out the paragraphs containing those expressions; a paragraph is defined as a block of text delimited by an empty or blank line; fully customizable via command line parameters.
(The silver searcher) is a text search utility targeted to source code; it skips versioning systems data directories; it is inspired by ack
, but faster.
A tool like grep
optimized for programmers; written in Perl, it speeds up searches thanks to skipping non interesting directories, such as .git
.
Executes SQL-like queries on CSVs/TSVs tabular data files; each tabular file is treated as a database table; support to all SQL constructs (WHERE
, GROUP BY
, JOIN
).
Utility that allows users to choose one option from a set of choices using an interface with fuzzy search functionality.
A Python script that
1) receives input lines from stdin
or a file,
2) lists the input lines and waits for input that filter/select the line(s),
3) outputs the selected line(s) to stdout
;
Can be used to add interactivity to many regular shell commands.
(JSON Query?) is sed-like processor for JSON data; can be used to process JSON files and data streams and perform operations such as those allowed by cat
, sed
, grep
and awk
on regular text files.
(Generic Colouriser) can be configured to parse a given text stream and to colorize it according to regexp written in configuration files; different patterns can be associated to file types.
ngrep
applies the grep
logic to the network layer, allowing to match regular expressions against data payloads of packets; it recognizes IPv4/6, TCP, UDP, ICMPv4/6, IGMP and Raw across Ethernet, PPP, SLIP, FDDI, Token Ring and null interfaces.
A simple, fast and user-friendly alternative to find. Written in Rust.