This guide walks you through making your first multimodal genomic prediction with AlphaGenomeR. By the end, you will have queried the AlphaGenome API for a 1 MB region on chromosome 17, extracted RNA-seq and ATAC-seq predictions for lung tissue, and inspected the resulting matrix and metadata data frame. Before you begin, complete the installation steps to install the R package, theDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/BDB-Genomics/AlphaGenomeR/llms.txt
Use this file to discover all available pages before exploring further.
alphagenome Python dependency, and obtain your API key.
Run your first prediction
Load the library
Start a fresh R session and load AlphaGenomeR. This also loads
reticulate and initializes the Python bridge on first use.Set your API key
Store your AlphaGenome API key in a variable. The recommended approach is to read it from an environment variable so it never appears in your scripts:If you have not set the environment variable yet, you can assign the key directly for testing — but avoid committing this to version control:
Define a genomic region
AlphaGenome requires a 1 MB genomic interval in The interval must be approximately 1 MB. The model enforces this constraint and will return an error for intervals of other sizes.
chr:start-end format using hg38 coordinates. This example uses a region on chromosome 17:Query the API
Call
alphagenome_query() with your API key, genomic region, tissue ontology term, and the modalities you want. This example requests RNA-seq and ATAC-seq predictions for lung tissue (UBERON:0002048):alphagenome_query() sends a gRPC request via the AlphaGenome Python SDK and returns the full multimodal prediction as an R list. The call may take several seconds depending on the number of requested modalities.Extract predictions
Use the modality-specific extractor functions to pull individual outputs from the results list:Each extractor returns a named list with two elements:
$values— a numeric matrix of predictions (positions × tracks)$metadata— a data frame describing each track (cell type, tissue, experiment)
Inspect the results
Explore the prediction matrix and track metadata:The
values matrix has one row per base position in the queried interval and one column per predicted track. The metadata data frame has one row per track, with columns identifying the tissue, cell type, and experimental conditions.What’s next
Guides
Learn how to manage API keys, work with genomic regions, query tissue-specific predictions, and integrate results with Bioconductor packages.
Modalities
Explore all 11 supported genomic modalities and the extractor function for each one.