Replicating analysis on Xenium lung cancer dataset

Hi,

I am trying to replicate the results from the [Proseg paper](https://www.nature.com/articles/s41592-025-02697-0#Sec9), specifically with the [Xenium lung cancer dataset](https://www.10xgenomics.com/datasets/preview-data-ffpe-human-lung-cancer-with-xenium-multimodal-cell-segmentation-1-standard). I am not able to replicate the plots in Figure 5, and would like to ask what parameters you used to run Proseg, and also in the preprocessing and subsequent analysis.

I used Proseg v3.1.0 with the following command:
```
proseg --output-spatialdata output_replication.zarr --output-expected-counts replication_expected_counts.mtx.gz --xenium ../transcripts.parquet
```

For the analysis, I used the following code to filter out cells with less than 10 transcripts and take the censored log proportions, as per the paper.
```
sc.pp.filter_cells(adata, min_counts=10)

cell_sums = adata.X.sum(axis=1)
temp_X = adata.X.copy()
temp_X = temp_X.toarray().astype(np.float64)
temp_X = temp_X / cell_sums
temp_X = np.clip(temp_X, a_min = 1e-4, a_max = None)
temp_X = np.log(temp_X)

adata.X = temp_X.copy()
```

Then I used the following code to perform dimensionality reduction and clustering.
```
sc.pp.neighbors(adata, n_neighbors = 15, use_rep = 'X')
sc.tl.umap(adata)
sc.tl.leiden(adata, resolution= 1)
sc.pl.umap(adata, color='leiden')
```

The UMAPs I got did not resemble those of Figure 5A - I have attached here the plots. If you could let me know how you analysed the Xenium lung cancer sample, I would greatly appreciate it. Thank you very much! 

<img width="581" height="429" alt="Image" src="https://github.com/user-attachments/assets/48da2977-6634-4cf5-9b6f-20580fa56af3" />

<img width="581" height="429" alt="Image" src="https://github.com/user-attachments/assets/cdf6b358-d204-478c-af97-02d1d820f326" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replicating analysis on Xenium lung cancer dataset #142

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Replicating analysis on Xenium lung cancer dataset #142

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions