Tullio Facchinetti
Tag cloud
Picture wall
Daily
RSS Feed
  • RSS Feed
  • ATOM Feed
  • Daily Feed
Filters

Links per page

  • 20 links
  • 50 links
  • 100 links

Display

Filter untagged links
12 results tagged csv  ✕
Miller - CSV/TSV and other formats toolkit http://johnkerl.org/miller/doc/index.html
Mon 20 Jul 2020 06:00:23 PM CEST
QRCode
csv data_science file_management homepage opensource software tools
structured-text-tools - A list of command line tools for manipulating structured text data https://github.com/dbohdan/structured-text-tools
Sun 31 May 2020 09:49:26 PM CEST
QRCode
csv list markdown text-processing tools
CleverCSV - A Python package for handling messy CSV files https://github.com/alan-turing-institute/CleverCSV
Tue 14 Jan 2020 06:45:51 AM CET
QRCode

CleverCSV provides a drop-in replacement for the Python csv package with improved dialect detection for messy CSV files. It also provides a handy command line tool that can standardize a messy file or generate Python code to import it.

algorithm coding_lang:python csv data_mining library opensource software
xsv - Doing a SQL join with CSV files https://www.johndcook.com/blog/2019/12/31/sql-join-csv-files/
Wed 01 Jan 2020 03:55:27 PM CET
QRCode

How to combine data spread over two CSV files, like separate tables in a normalized relational database.

#cli-app article csv database programming terminal tutorial
csv2tex https://github.com/O2-AC/csv2tex/blob/master/README.md
Sat 09 Nov 2019 09:53:02 PM CET
QRCode

Converts csv files into LaTeX tables. Contribute to O2-AC/csv2tex development by creating an account on GitHub.

command_line conversion csv latex opensource software terminal
Data Package specifications https://frictionlessdata.io/specs/data-package/
Fri 06 Sep 2019 04:11:31 PM CEST
QRCode

A Data Package consists of:

  • Metadata that describes the structure and contents of the package
  • Resources such as data files that form the contents of the package
  • The Data Package metadata is stored in a "descriptor". This descriptor is what makes a collection of data a Data Package. The structure of this descriptor is the main content of the specification below.

In addition to this descriptor a data package will include other resources such as data files. The Data Package specification does NOT impose any requirements on their form or structure and can therefore be used for packaging any kind of data.

The data included in the package may be provided as:

  • Files bundled locally with the package descriptor
  • Remote resources, referenced by URL
  • "Inline" data (see below) which is included directly in the descriptor
csv data file_format homepage json
frictionlessdata | A Python library for working with Data Packages https://github.com/frictionlessdata/datapackage-py
Fri 06 Sep 2019 04:07:40 PM CEST
QRCode
coding_lang:python csv data dataset file_format json library machine_learning python source_code
TSV Utilities - Command line tools for large, tabular data files https://github.com/eBay/tsv-utils
Sun 01 Sep 2019 09:56:38 PM CEST
QRCode

This is a set of command line utilities for manipulating large tabular data files. Files of numeric and text data commonly found in machine learning, data mining, and similar environments. Filtering, sampling, statistics, joins, and more.

These tools are especially useful when working with large data sets. They run faster than other tools providing similar functionality, often by significant margins. See Performance Studies for comparisons with other tools.

They perform data manipulation and statistical calculations on tab delimited data. They are intended for large files. Larger than ideal for loading entirely in memory in an application like R, but not so big as to necessitate moving to Hadoop or similar distributed compute environments. The features supported are useful both for standalone analysis and for preparing data for use in R, Pandas, and similar toolkits.

From eBay.

#cli-app coding_lang:d command_line csv file_management filter opensource software
q - Run SQL-like queries on CSV/TSV files http://harelba.github.io/q/
Sun 25 Aug 2019 04:54:03 PM CEST
QRCode

Executes SQL-like queries on CSVs/TSVs tabular data files; each tabular file is treated as a database table; support to all SQL constructs (WHERE, GROUP BY, JOIN).

#cli-app category:text_processing csv filter opensource search software source_code SQL terminal text-processing tools
Turn Vim Into Excel: Tips for Editing Tabular Data http://alangrow.com/blog/turn-vim-into-excel-tips-for-tabular-data-editing
Mon 03 Dec 2018 04:07:04 AM CET
QRCode

The author tried to edit data in spreadsheet programs.

This post illustrate ho to use Vim to edit tabular data, although there are a few things that will make it more pleasant. It is assumed that editing files are in tab-separated value format (TSV).

"But what about CSV files?" Just. Don't.

Do: convert your CSV to TSV and back for editing.

image

article csv post text_manipulation tutorial vim
A data cleaner's cookbook - About https://www.polydesmida.info/cookbook/index.html
Thu 16 Aug 2018 03:50:26 PM CEST
QRCode

This is version 1 of a cookbook that will help you check whether a data table (defined on the data tables page) is properly structured and free from formatting errors, inconsistencies, duplicates and other data headaches.

article csv formatting guidelines tutorial
xsv - A fast CSV command line toolkit written in Rust https://github.com/BurntSushi/xsv
Thu 16 Aug 2018 03:17:17 PM CEST
QRCode

xsv is a command line program for indexing, slicing, analyzing, splitting and joining CSV files. Commands should be simple, fast and composable:

  1. Simple tasks should be easy.
  2. Performance trade offs should be exposed in the CLI interface.
  3. Composition should not come at the expense of performance.
#cli-app coding_lang:rust csv file_management opensource software
3672 links
Shaarli - The personal, minimalist, super-fast, database free, bookmarking service by the Shaarli community - Theme by kalvn