New to tiledb. Got a tiledb cloud account with $10 credits. Executing the Notebook TileDB-VCF tutorial. Import tiledbvcf is erroring out. I shouldn’t even need this locally on my mac because I was trying to run it within the TileDB cloud. Anyways went ahead and installed conda and tiledbvcf successfully on my mac using the commands below:
Hello @Sal ! When running in TileDB Cloud notebooks we have three image options currently when launching a notebook, Basic Data Science, Genomics and Geospatial. Do you know which notebook image you were in when you were trying to run the tutorial that errored? You need to be in the Genomics image for tiledbvcf to be available.
TileDB Cloud also supports automatically selecting the notebook image for you for most tutorials. This works if you do not already have a notebook server running. Would you mind linking me to the tutorial you were trying to run and I’ll double check that it defaults to Genomics in the “Launch” button on the notebook details page.
For the module error on your macbook. Can you let me know, is the script you were running in the same conda environment as the one line python -c "import tiledbvcf; print(tiledbvcf.version)" you ran? Can you also let me know how you install tiledb-cloud was that with pip? I’m looking to try to see what might be different between when you ran the python command and then the script.
[quote=“seth, post:2, topic:661, full:true”]
Hello @Sal ! When running in TileDB Cloud notebooks we have three image options currently when launching a notebook, Basic Data Science, Genomics and Geospatial. Do you know which notebook image you were in when you were trying to run the tutorial that errored? You need to be in the Genomics image for tiledbvcf to be available.
I logged into the cloud console. It automatically selected a ‘Basic data science’ image and I opened up the tutorial_tiledbvcf_basics.ipynb. How do I switch to the ‘Genomics’ image ?
TileDB Cloud also supports automatically selecting the notebook image for you for most tutorials. This works if you do not already have a notebook server running. Would you mind linking me to the tutorial you were trying to run and I’ll double check that it defaults to Genomics in the “Launch” button on the notebook details page.
For the module error on your macbook. Can you let me know, is the script you were running in the same conda environment as the one line python -c "import tiledbvcf; print(tiledbvcf.version)" you ran? Can you also let me know how you install tiledb-cloud was that with pip? I’m looking to try to see what might be different between when you ran the python command and then the script.
[/quote]
I did not install tiledb-cloud. I ran the python command from my terminal window on my mac. The TleBD-VCF tutorial Notebook I ran from the TileDB Cloud console.
Thanks for the additional details @Sal . In TileDB Cloud the easiest way to switch notebook images is to click the “Shut down” button on the top right corner. After your notebook is shutdown, you can select “Compute” on the left hand side menu. From the compute screen you can then then select launch notebook and select the image type.
@Sal thanks for bringing this additional error up. The root cause is that the bucket where the example VCF files is stored is located in us-east-1. The notebook you launched was in us-west-2. A small adjustment to the tutorial was needed to ensure setting the region. I’ve updated the notebook, the change was in cell #7, adding a config of the region:
# We set the region to us-east-1 so we can load the example vcf files
ds = tiledbvcf.Dataset(uri=array_uri, mode="w", cfg=tiledbvcf.ReadConfig(tiledb_config={"vfs.s3.region": "us-east-1"}))
ds```