Writing sparse arrays in Python with Unicode containing NaN

ihnorton · November 24, 2020, 8:44pm

A couple comments – happy to dig in further if I misunderstood what you mean by “from scratch” here:

You only need to specify/fill the columns with NAs.
If you are using Pandas read_csv implementation directly, there are a variety of options for handling NAs, including overrides for interpretation of other strings as NA (see here).
For Pandas dataframes, you can call pandas.DataFrame.fillna directly to accomplish the same thing directly on the dataframe.

Indeed! Another highlight is the “Hilbert” tiling feature, which will simplify array creation as well as providing significant performance boosts. There is an active list of upcoming features and improvements here:

github.com

TileDB-Inc/TileDB/blob/dev/HISTORY.md#new-features

# TileDB v2.18.0 Release Notes

## Announcements

* TileDB 2.18, targeted for release in November 2023, includes a preview set of aggregate pushdown APIs. The APIs will be finalized in 2.19 with performance improvements.

## Disk Format

* Fix the format specification for group members. [#4380](https://github.com/TileDB-Inc/TileDB/pull/4380)
* Update fragment format spec for info on tile sizes and tile offsets. [#4416](https://github.com/TileDB-Inc/TileDB/pull/4416)

## Configuration changes

* Remove vfs.file.max_parallel_ops config option. [#3964](https://github.com/TileDB-Inc/TileDB/pull/3964)

## Breaking C API changes

* Behavior breaking change: `tiledb_group_remove_member` cannot remove named members by URI if the URI is duplicated. [#4391](https://github.com/TileDB-Inc/TileDB/pull/4391)

## New features

This file has been truncated. show original

Best,
Isaiah

Topic		Replies	Views
Storing waveform segment as var attr in sparse array	2	732	February 5, 2021
Am I wrongly filling a sparse array that has a variable length string attribute? Or is this a bug?	4	919	August 15, 2023
Weird behavior with variable length attributes	2	574	October 19, 2022
Using a multi-dimensional sparse array in python	4	1233	July 31, 2019
Pandas dataframe examples?	4	2209	October 21, 2020

Writing sparse arrays in Python with Unicode containing NaN

Related topics