Skip to content

feat: Add new file type for data files #1735

@namurphy

Description

@namurphy

Suggestion

Add a new file type extension for commonly used data files in src/info/filetype.rs.

Description

There are a number of widely used file extensions for data files. These files include text-based data (e.g., .csv or comma-separated value files), binary data (e.g., .hdf5 or .nc for scientific applications), and databases (e.g., .sqlite or .sqlite3. Because using data files is a common occurrence for many of us, it would be quite helpful to have a straightforward way to visually distinguish data files from other file types.

My personal motivation is that while customizing theme.yml, I found myself adding a bunch of different data file types that come up for science and research software engineering, and it was quite tedious having to do this for a wide variety of data file types.

File extensions

Here is a subset of the file extensions that could be added, though it's missing quite a few extensions (especially for databases).

Extension Description
.asdf Advanced Scientific Data Format
.bin generic binary files
.cdf CDF
.csv comma separated value
.dat generic data files
.db generic database
.geojson GeoJSON
.fit, .fits, .fts FITS
.h5, .hdf, .hdf5 HDF5
.ini INI
.json JSON
.nc NetCDF
.npy NumPy array file
.npz NumPy compressed archive
.parquet Parquet
.pkl Python pickle
.shp, .shx, .dbf, .prj ESRI Shapefile
.sqlite, .sqlite3 SQLite
.tsv tab separated value
.toml TOML
.yaml, .yml YAML

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions