Implement Feature/finding marker genes to develop #32

happymealinthebuilding · 2025-05-23T20:22:46Z

This Pull Request integrates the finding marker genes branch into develop. This integration provides a foundational set of tools and a demonstrated workflow for identifying and working with marker genes. The utility functions offer flexibility for analyzing clustering results and visualizing gene expression patterns, which will be crucial for ongoing and future research requiring cell type characterization and biomarker discovery.

…ntation

…NA-seq data

…AnnData

…ell RNA-seq data

Implement marker gene expression visualization Introduces the `visualize_marker_genes` function for visualizing marker gene expression across cell clusters in an AnnData object. This function: - Accepts an `adata` object, a list/dict of `marker_genes`, and a `cluster_key` (defaulting to 'leiden') from `adata.obs` for grouping. - Generates a dot plot using `sc.pl.dotplot` to show average expression and the percentage of cells expressing each marker gene per cluster. - Additionally, creates a stacked violin plot using `sc.pl.stacked_violin` to display the distribution of marker gene expression within each cluster. - Facilitates comprehensive visual identification and validation of potential marker genes.

…ies_chosen_algorithm_new_cluster_names

…lgorithm_new_cluster_names Standardize cluster categories and names Implements the `rename_clusters` function to update cluster labels within an AnnData object. This function: - Takes an `adata` object, a `cluster_algo` key (e.g., 'leiden', 'louvain' from `adata.obs`), and a list of `new_cluster_names`. - Validates that the `cluster_algo` key exists in `adata.obs` and that the number of `new_cluster_names` matches the existing number of clusters. - Utilizes `adata.rename_categories()` to perform the renaming in-place. - Ensures consistent and interpretable cluster labeling for downstream marker gene analysis.

Extract top N ranked marker genes and p-values Implements the `extract_top_genes` function to retrieve top-ranked genes and their associated p-values from pre-computed `rank_genes_groups` results stored in an AnnData object. This function: - Takes an `adata` object and an optional `n_top` parameter (defaulting to 5) to specify the number of top genes per group. - Validates the presence of `adata.uns['rank_genes_groups']` before processing. - Parses the `names` and `pvals` fields from the `rank_genes_groups` results. - Returns a tuple containing two pandas DataFrames: 1. A DataFrame of top gene names per cluster/group. 2. A combined DataFrame with top gene names and their corresponding p-values for each cluster/group. - Provides a structured and quantitative list of candidate marker genes.

happymealinthebuilding and others added 19 commits April 11, 2025 20:07

Add get_top_ranked_genes

412c9b4

Add rename_clusters function to rename cluster labels in adata.obs

77bd5ef

Move rename_clusters function to modules directory and update impleme…

d21df2e

…ntation

Update .gitignore to include data directory

9e65fce

Add dotplot visualization for marker gene expression in single-cell R…

af76b89

…NA-seq data

Add tutorial comment to demo_get_top_ranked_genes.py

bcbe0e0

Remove tutorial comment from demo_get_top_ranked_genes.py

ce53b0b

Add Tutorial File

6c83a44

Remove tutorial file from the project

66685d0

Add files via upload

fb150a9

Added marked_genes_demo.py file

655ed2d

Update marked_genes.py

e227aa5

Add rename_cluster_demo.py to demonstrate renaming cluster labels in …

96cc703

…AnnData

Add get_top_ranked_genes_demo function

f919f47

Add dotplot visualization demo for marker gene expression in single-c…

a5d9159

…ell RNA-seq data

Merge branch 'feature/finding_marker_genes' into adata_rename_categor…

f3eff81

…ies_chosen_algorithm_new_cluster_names

happymealinthebuilding requested a review from TRextabat May 23, 2025 20:22

happymealinthebuilding assigned TRextabat May 23, 2025

happymealinthebuilding added this to Single Cell Web May 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Feature/finding marker genes to develop #32

Implement Feature/finding marker genes to develop #32

Uh oh!

happymealinthebuilding commented May 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Implement Feature/finding marker genes to develop #32

Are you sure you want to change the base?

Implement Feature/finding marker genes to develop #32

Uh oh!

Conversation

happymealinthebuilding commented May 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants