diff --git a/docs/source/Combination_Kidney_IgAN.md b/docs/source/Combination_Kidney_IgAN.md
index 8f32d94..5752119 100644
--- a/docs/source/Combination_Kidney_IgAN.md
+++ b/docs/source/Combination_Kidney_IgAN.md
@@ -1,9 +1,10 @@
-### Trajectory Analysis of Kidney IgAN Data with PILOT
+### Trajectory Analysis and Integration of Modalities using Kidney IgAN (Pathomics) Data with PILOT
 
 <div class="alert alert-block alert-info">
 <b>PILOT</b>
 
 Welcome to the PILOT Package Tutorial for pathomics Data!
+With this tutorial, we learn not only how to analyze the Pathomics Data but also how to integrate multimodal data with PILOT. 
  
 You can find the pathomics data [here](https://github.com/CostaLab/PILOT/tree/main/Tutorial/Datasets).
 
@@ -15,7 +16,7 @@ import pilotpy as pl
 import scanpy as sc
 ```
 
-#### Kidney_IgAN Tubuli
+#### Kidney_IgAN Tubuli (first modality)
 
 ##### Reading Anndata
 <div class="alert alert-block alert-info">
@@ -93,7 +94,7 @@ pl.pl.trajectory(adata_T, colors = ['red','blue','orange'])
     
 
 
-#### Kidney_IgAN Glomeruli
+#### Kidney_IgAN Glomeruli (second modality)
 
 ##### Reading Anndata
 <div class="alert alert-block alert-info">
@@ -144,9 +145,9 @@ pl.pl.trajectory(adata_G, colors = ['red','blue','orange'])
     
 
 
-#### Combination:
+####  Integration of modalities:
 <div class="alert alert-block alert-info"> 
-Here, we combine the distances of samples. We get the sum of distances of samples based on Tubuli and Glomeruli distances.   
+Here, we integrate the distances from the first (Tubuli) and second (Glomeruli) modalities. We get the sum of the distances of samples.
 </div>
 
 
diff --git a/docs/source/Myocardial_infarction.md b/docs/source/Myocardial_infarction.md
index 2082c88..4197429 100644
--- a/docs/source/Myocardial_infarction.md
+++ b/docs/source/Myocardial_infarction.md
@@ -152,505 +152,6 @@ for cell in adata.uns['cellnames']:
     plot_genes = False)
 ```
 
-##### Cluster Specific Marker Changes:
-<div class="alert alert-block alert-info"> 
-The previous test only finds genes with significant changes over time for a given cell type. However, it does not consider if a similar pattern and expression values are found in other clusters. To further select genes, we use a Wald test that compares the fit of the gene in the cluster vs. the fit of the gene in other clusters.
-In the code below, we consider top genes (regarding the regression fit) for two interesting cell types discussed in the manuscript (‘healthy CM’ and ‘Myofib’).
-</div>
-
-
-```python
-pl.tl.gene_cluster_differentiation(adata,cellnames = ['healthy_CM','Myofib'], number_genes = 70)
-```
-
-    
-
-
-
-<div class="alert alert-block alert-info"> 
-Test results are saved in ‘gene_clusters_stats_extend.csv’. To find a final list of genes, we only consider genes with a fold change higher than 0.5, i.e. genes which expression is increased in the cluster at hand; and we sort the genes based on the Wald test p-value. These can be seen bellow.
-</div>
-
-
-```python
-df = pl.tl.results_gene_cluster_differentiation(cluster_name = 'Myofib',).head(50)
-df.head(15)
-```
-
-
-
-
-<div>
-<style scoped>
-    .dataframe tbody tr th:only-of-type {
-        vertical-align: middle;
-    }
-
-    .dataframe tbody tr th {
-        vertical-align: top;
-    }
-
-    .dataframe thead th {
-        text-align: right;
-    }
-</style>
-<table border="1" class="dataframe">
-  <thead>
-    <tr style="text-align: right;">
-      <th></th>
-      <th>gene</th>
-      <th>cluster</th>
-      <th>waldStat</th>
-      <th>pvalue</th>
-      <th>FC</th>
-      <th>Expression pattern</th>
-      <th>fit-pvalue</th>
-      <th>fit-mod-rsquared</th>
-    </tr>
-  </thead>
-  <tbody>
-    <tr>
-      <th>2642</th>
-      <td>GAS7</td>
-      <td>Myofib</td>
-      <td>212.477292</td>
-      <td>8.487275e-46</td>
-      <td>1.086644</td>
-      <td>linear up quadratic down</td>
-      <td>1.873033e-107</td>
-      <td>0.570704</td>
-    </tr>
-    <tr>
-      <th>2151</th>
-      <td>EXT1</td>
-      <td>Myofib</td>
-      <td>125.383128</td>
-      <td>5.344198e-27</td>
-      <td>0.786136</td>
-      <td>linear up quadratic down</td>
-      <td>3.159831e-35</td>
-      <td>0.555757</td>
-    </tr>
-    <tr>
-      <th>4979</th>
-      <td>PKNOX2</td>
-      <td>Myofib</td>
-      <td>89.738712</td>
-      <td>2.492742e-19</td>
-      <td>0.855504</td>
-      <td>quadratic down</td>
-      <td>1.039404e-117</td>
-      <td>0.544122</td>
-    </tr>
-    <tr>
-      <th>2529</th>
-      <td>FN1</td>
-      <td>Myofib</td>
-      <td>70.641696</td>
-      <td>3.110595e-15</td>
-      <td>1.573680</td>
-      <td>linear down quadratic up</td>
-      <td>2.947389e-188</td>
-      <td>0.633774</td>
-    </tr>
-    <tr>
-      <th>1437</th>
-      <td>COL6A3</td>
-      <td>Myofib</td>
-      <td>54.751169</td>
-      <td>7.758841e-12</td>
-      <td>1.069156</td>
-      <td>linear down quadratic up</td>
-      <td>3.514298e-172</td>
-      <td>0.608543</td>
-    </tr>
-    <tr>
-      <th>5775</th>
-      <td>RORA</td>
-      <td>Myofib</td>
-      <td>52.486295</td>
-      <td>2.359167e-11</td>
-      <td>0.899459</td>
-      <td>quadratic down</td>
-      <td>7.232834e-174</td>
-      <td>0.587234</td>
-    </tr>
-    <tr>
-      <th>2832</th>
-      <td>GXYLT2</td>
-      <td>Myofib</td>
-      <td>24.247113</td>
-      <td>2.218154e-05</td>
-      <td>2.000205</td>
-      <td>linear up quadratic down</td>
-      <td>2.402171e-85</td>
-      <td>0.537920</td>
-    </tr>
-    <tr>
-      <th>3783</th>
-      <td>MGP</td>
-      <td>Myofib</td>
-      <td>23.244418</td>
-      <td>3.591226e-05</td>
-      <td>0.871041</td>
-      <td>quadratic down</td>
-      <td>1.327779e-225</td>
-      <td>0.571374</td>
-    </tr>
-    <tr>
-      <th>4726</th>
-      <td>PCDH9</td>
-      <td>Myofib</td>
-      <td>20.439646</td>
-      <td>1.376052e-04</td>
-      <td>0.604830</td>
-      <td>linear down</td>
-      <td>0.000000e+00</td>
-      <td>0.596035</td>
-    </tr>
-    <tr>
-      <th>1231</th>
-      <td>CHD9</td>
-      <td>Myofib</td>
-      <td>20.389564</td>
-      <td>1.409364e-04</td>
-      <td>0.527488</td>
-      <td>linear up quadratic down</td>
-      <td>7.658862e-77</td>
-      <td>0.559604</td>
-    </tr>
-    <tr>
-      <th>1710</th>
-      <td>DCN</td>
-      <td>Myofib</td>
-      <td>19.656307</td>
-      <td>1.999818e-04</td>
-      <td>1.033697</td>
-      <td>linear up quadratic down</td>
-      <td>1.866152e-284</td>
-      <td>0.588602</td>
-    </tr>
-    <tr>
-      <th>2824</th>
-      <td>GSN</td>
-      <td>Myofib</td>
-      <td>18.015612</td>
-      <td>4.366007e-04</td>
-      <td>0.638136</td>
-      <td>linear up quadratic down</td>
-      <td>2.942472e-279</td>
-      <td>0.601684</td>
-    </tr>
-    <tr>
-      <th>1392</th>
-      <td>COL3A1</td>
-      <td>Myofib</td>
-      <td>17.276479</td>
-      <td>6.199787e-04</td>
-      <td>1.240454</td>
-      <td>linear down quadratic up</td>
-      <td>0.000000e+00</td>
-      <td>0.665616</td>
-    </tr>
-    <tr>
-      <th>1372</th>
-      <td>COL1A2</td>
-      <td>Myofib</td>
-      <td>14.068816</td>
-      <td>2.812963e-03</td>
-      <td>1.327753</td>
-      <td>linear down quadratic up</td>
-      <td>0.000000e+00</td>
-      <td>0.655032</td>
-    </tr>
-    <tr>
-      <th>7245</th>
-      <td>VCAN</td>
-      <td>Myofib</td>
-      <td>12.610158</td>
-      <td>5.560192e-03</td>
-      <td>0.838764</td>
-      <td>linear down quadratic up</td>
-      <td>1.761922e-164</td>
-      <td>0.571981</td>
-    </tr>
-  </tbody>
-</table>
-</div>
-
-
-
-<div class="alert alert-block alert-info"> 
-Here is the GO enrichment for  the 50 first top genes of Myofib (FC >= 0.5 and p-value < 0.01). Plot is saved at Go folder.
-</div>
-
-
-```python
-pl.pl.go_enrichment(df, cell_type = 'Myofib')
-```
-
-
-    
-![png](Myocardial_infarction_files/Myocardial_infarction_23_0.png)
-    
-
-
-<div class="alert alert-block alert-info"> 
-We can visualize specific genes, for example the ones discussed in PILOT manuscript (COL1A2, DCN and EXT1). In the plot, the orange line indicates the fit in the target cell type (shown as orange lines) compared to other cell types (represented by grey lines). Plots of genes are saved at 'plot_genes_for_Myofib' folder.
-</div>
-
-
-```python
-pl.pl.exploring_specific_genes(cluster_name = 'Myofib', gene_list = ['COL1A2','DCN','EXT1'])
-```
-
-
-
-
-
-    
-![png](Myocardial_infarction_files/Myocardial_infarction_25_1.png)
-    
-
-
-<div class="alert alert-block alert-info"> 
-We can repeat the same analysis for healthy_CM cell type by using the following commands.
-</div>
-
-
-```python
-df=pl.tl.results_gene_cluster_differentiation(cluster_name = 'healthy_CM').head(50)
-df.head(15)
-```
-
-
-
-
-<div>
-<style scoped>
-    .dataframe tbody tr th:only-of-type {
-        vertical-align: middle;
-    }
-
-    .dataframe tbody tr th {
-        vertical-align: top;
-    }
-
-    .dataframe thead th {
-        text-align: right;
-    }
-</style>
-<table border="1" class="dataframe">
-  <thead>
-    <tr style="text-align: right;">
-      <th></th>
-      <th>gene</th>
-      <th>cluster</th>
-      <th>waldStat</th>
-      <th>pvalue</th>
-      <th>FC</th>
-      <th>Expression pattern</th>
-      <th>fit-pvalue</th>
-      <th>fit-mod-rsquared</th>
-    </tr>
-  </thead>
-  <tbody>
-    <tr>
-      <th>6165</th>
-      <td>SORBS1</td>
-      <td>healthy_CM</td>
-      <td>1574.665604</td>
-      <td>0.000000e+00</td>
-      <td>1.296470</td>
-      <td>linear down quadratic up</td>
-      <td>8.946560e-05</td>
-      <td>0.522953</td>
-    </tr>
-    <tr>
-      <th>1772</th>
-      <td>DLG2</td>
-      <td>healthy_CM</td>
-      <td>1055.313030</td>
-      <td>1.801893e-228</td>
-      <td>1.155496</td>
-      <td>linear down quadratic up</td>
-      <td>1.323610e-256</td>
-      <td>0.556306</td>
-    </tr>
-    <tr>
-      <th>6733</th>
-      <td>THSD4</td>
-      <td>healthy_CM</td>
-      <td>834.288239</td>
-      <td>1.583902e-180</td>
-      <td>1.671315</td>
-      <td>linear down quadratic up</td>
-      <td>6.088694e-250</td>
-      <td>0.582085</td>
-    </tr>
-    <tr>
-      <th>1276</th>
-      <td>CMYA5</td>
-      <td>healthy_CM</td>
-      <td>752.301407</td>
-      <td>9.561746e-163</td>
-      <td>1.559703</td>
-      <td>linear down quadratic up</td>
-      <td>3.774063e-66</td>
-      <td>0.527869</td>
-    </tr>
-    <tr>
-      <th>3281</th>
-      <td>LDB3</td>
-      <td>healthy_CM</td>
-      <td>542.239458</td>
-      <td>3.342198e-117</td>
-      <td>1.426196</td>
-      <td>linear down quadratic up</td>
-      <td>1.511694e-238</td>
-      <td>0.546327</td>
-    </tr>
-    <tr>
-      <th>36</th>
-      <td>ABLIM1</td>
-      <td>healthy_CM</td>
-      <td>379.423867</td>
-      <td>6.335728e-82</td>
-      <td>0.979378</td>
-      <td>linear up</td>
-      <td>3.296026e-13</td>
-      <td>0.513734</td>
-    </tr>
-    <tr>
-      <th>6903</th>
-      <td>TNNT2</td>
-      <td>healthy_CM</td>
-      <td>373.037957</td>
-      <td>1.530428e-80</td>
-      <td>1.392561</td>
-      <td>linear down quadratic up</td>
-      <td>2.329698e-118</td>
-      <td>0.535236</td>
-    </tr>
-    <tr>
-      <th>2398</th>
-      <td>FHOD3</td>
-      <td>healthy_CM</td>
-      <td>343.341326</td>
-      <td>4.125161e-74</td>
-      <td>1.731741</td>
-      <td>linear down quadratic up</td>
-      <td>0.000000e+00</td>
-      <td>0.612758</td>
-    </tr>
-    <tr>
-      <th>6663</th>
-      <td>TECRL</td>
-      <td>healthy_CM</td>
-      <td>338.848075</td>
-      <td>3.875199e-73</td>
-      <td>1.261289</td>
-      <td>linear up quadratic down</td>
-      <td>0.000000e+00</td>
-      <td>0.570571</td>
-    </tr>
-    <tr>
-      <th>4056</th>
-      <td>MYBPC3</td>
-      <td>healthy_CM</td>
-      <td>296.157297</td>
-      <td>6.751814e-64</td>
-      <td>0.686940</td>
-      <td>linear up quadratic down</td>
-      <td>0.000000e+00</td>
-      <td>0.557570</td>
-    </tr>
-    <tr>
-      <th>5652</th>
-      <td>RCAN2</td>
-      <td>healthy_CM</td>
-      <td>287.996090</td>
-      <td>3.940736e-62</td>
-      <td>1.214055</td>
-      <td>linear down quadratic up</td>
-      <td>0.000000e+00</td>
-      <td>0.566313</td>
-    </tr>
-    <tr>
-      <th>1830</th>
-      <td>DOCK3</td>
-      <td>healthy_CM</td>
-      <td>269.653643</td>
-      <td>3.667754e-58</td>
-      <td>0.534836</td>
-      <td>linear down quadratic up</td>
-      <td>2.678914e-202</td>
-      <td>0.527979</td>
-    </tr>
-    <tr>
-      <th>4177</th>
-      <td>MYOM1</td>
-      <td>healthy_CM</td>
-      <td>236.482875</td>
-      <td>5.483541e-51</td>
-      <td>1.637375</td>
-      <td>linear down</td>
-      <td>1.381677e-268</td>
-      <td>0.548281</td>
-    </tr>
-    <tr>
-      <th>1915</th>
-      <td>EFNA5</td>
-      <td>healthy_CM</td>
-      <td>236.263957</td>
-      <td>6.115035e-51</td>
-      <td>1.089847</td>
-      <td>linear down</td>
-      <td>1.127515e-164</td>
-      <td>0.532078</td>
-    </tr>
-    <tr>
-      <th>5436</th>
-      <td>PXDNL</td>
-      <td>healthy_CM</td>
-      <td>227.066418</td>
-      <td>5.957588e-49</td>
-      <td>1.284827</td>
-      <td>linear down quadratic up</td>
-      <td>1.815885e-03</td>
-      <td>0.518712</td>
-    </tr>
-  </tbody>
-</table>
-</div>
-
-
-
-<div class="alert alert-block alert-info"> 
-Plot is saved at Go folder.
-</div>
-
-
-```python
-pl.pl.go_enrichment(df, cell_type = 'healthy_CM')
-```
-
-
-    
-![png](Myocardial_infarction_files/Myocardial_infarction_29_0.png)
-    
-
-
-<div class="alert alert-block alert-info"> 
-Plots of genes are saved at 'plot_genes_for_healthy_CM' folder.
-</div>
-
-
-```python
-pl.pl.exploring_specific_genes(cluster_name = 'healthy_CM', gene_list = ['MYBPC3','MYOM1','FHOD3'])
-```
- 
-![png](Myocardial_infarction_files/Myocardial_infarction_31_1.png)
 
 ##### Group genes by pattern:
 
@@ -720,12 +221,15 @@ body {font-family: Arial;}
   border: 1px solid #ccc;
   border-top: none;
 }
+.tabcontent.active {
+    display: block;
+}
 </style>
 
 In each tab below, you can check the information for each cluster.
 
 <div class="tab button.active">
-    <button class="tablinks" onclick="openCity(event, 'cluster11')">Cluster 1</button>
+    <button class="tablinks active" onclick="openCity(event, 'cluster11')">Cluster 1</button>
     <button class="tablinks" onclick="openCity(event, 'cluster12')">Cluster 2</button>
     <button class="tablinks" onclick="openCity(event, 'cluster13')">Cluster 3</button>
     <button class="tablinks" onclick="openCity(event, 'cluster14')">Cluster 4</button>
@@ -733,40 +237,34 @@ In each tab below, you can check the information for each cluster.
     <button class="tablinks" onclick="openCity(event, 'cluster16')">Cluster 6</button>
 </div>
 
-<div id="cluster11" class="tabcontent">
+<div id="cluster11" class="tabcontent active">
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster11.png" >
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster12.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster13.png" >
 </div>
 
 <div id="cluster12" class="tabcontent">
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster21.png" >
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster22.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster23.png" >
 </div>
 
 <div id="cluster13" class="tabcontent">
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster31.png" >
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster32.png" >
-    No GO information for cluster 3!
 </div>
 
 <div id="cluster14" class="tabcontent">
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster41.png" >
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster42.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster43.png" >
 </div>
 
 <div id="cluster15" class="tabcontent">
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster51.png" >
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster52.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster53.png" >
 </div>
 
 <div id="cluster16" class="tabcontent">
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster61.png" >
     <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster62.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/healthy_CM_cluster63.png" >
 </div>
 
 
@@ -804,72 +302,62 @@ In the table, you can check the curves activities of some genes of the healthy_C
 
 The complete table can be found here: [healthy_CM_curves_activities.csv](https://costalab.ukaachen.de/open_data/PILOT/healthy_CM_curves_activities.csv)
 
+##### Cluster Specific Marker Changes:
+<div class="alert alert-block alert-info"> 
+The previous test only finds genes with significant changes over time for a given cell type. However, it does not consider if a similar pattern and expression values are found in other clusters. To further select genes, we use a Wald test that compares the fit of the gene in the cluster vs. the fit of the gene in other clusters.
+In the code below, we consider top genes (regarding the regression fit) for two interesting cell types discussed in the manuscript (‘healthy CM’ and ‘Myofib’).
+</div>
+
 
 ```python
-pl.pl.genes_selection_analysis(adata, 'Myofib', scaler_value = 0.5)
+pl.tl.gene_cluster_differentiation(adata,cellnames = ['healthy_CM','Myofib'], number_genes = 70)
 ```
 
-![png](Myocardial_infarction_files/Myofib_heatmap.png)
+    
+
+
+
+<div class="alert alert-block alert-info"> 
+Test results are saved in ‘gene_clusters_stats_extend.csv’. To find a final list of genes, we only consider genes with a fold change higher than 0.5, i.e. genes which expression is increased in the cluster at hand; and we sort the genes based on the Wald test p-value. These can be seen bellow.
+</div>
 
-Here, we utilize the [Enrichr](https://maayanlab.cloud/Enrichr/) tools to get the hallmarks of the clustered genes. The default dataset is MSigDB_Hallmark_2020, which you can change using the `gene_set_library` parameter.
 
 ```python
-pl.pl.plot_hallmark_genes_clusters(adata, 'Myofib', 'MSigDB_Hallmark_2020')
+df = pl.tl.results_gene_cluster_differentiation(cluster_name = 'healthy_CM',).head(50)
+df.head(15)
 ```
+| gene     | cluster   | waldStat         | pvalue        | FC               | Expression pattern       | fit-pvalue      | fit-mod-rsquared |
+|----------|-----------|------------------|---------------|------------------|--------------------------|-----------------|------------------|
+| SORBS1   | healthy_CM| 1574.665604      | 0.000000e+00  | 1.296470         | linear down quadratic up | 8.946560e-05    | 0.522953         |
+| DLG2     | healthy_CM| 1055.313030      | 1.801893e-228 | 1.155496         | linear down quadratic up | 1.323610e-256   | 0.556306         |
+| MYOM1    | healthy_CM| 236.482875       | 5.483541e-51  | 1.637375         | linear down              | 1.381677e-268   | 0.548281         |
+| FHOD3    | healthy_CM| 343.341326       | 4.125161e-74  | 1.731741         | linear down quadratic up | 0.000000e+00    | 0.612758         |
+| MYBPC3   | healthy_CM| 296.157297       | 6.751814e-64  | 0.686940         | linear up quadratic down | 0.000000e+00    | 0.557570         |
+| ...      | ...       | ...              | ...           | ...              | ...                      | ...             | ...              |
 
-![png](Myocardial_infarction_files/Myofib_hallmark.png)
-
-In each tab below, you can check the information for each cluster.
 
-<div class="tab button.active">
-    <button class="tablinks" onclick="openCity(event, 'cluster21')">Cluster 1</button>
-    <button class="tablinks" onclick="openCity(event, 'cluster22')">Cluster 2</button>
-    <button class="tablinks" onclick="openCity(event, 'cluster23')">Cluster 3</button>
-    <button class="tablinks" onclick="openCity(event, 'cluster24')">Cluster 4</button>
+<div class="alert alert-block alert-info"> 
+Here is the GO enrichment for  the 50 first top genes of healthy_CM (FC >= 0.5 and p-value < 0.01). Plot is saved at Go folder.
 </div>
 
-<div id="cluster21" class="tabcontent">
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster11.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster12.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster13.png" >
-</div>
 
-<div id="cluster22" class="tabcontent">
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster21.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster22.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster23.png" >
-</div>
+```python
+pl.pl.go_enrichment(df, cell_type = 'healthy_CM')
+```
+![png](Myocardial_infarction_files/Myocardial_infarction_29_0.png)
+    
 
-<div id="cluster23" class="tabcontent">
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster31.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster32.png" >
-    No GO information for cluster 3!
-</div>
 
-<div id="cluster24" class="tabcontent">
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster41.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster42.png" >
-    <img src="https://costalab.ukaachen.de/open_data/PILOT/images/Myofib_cluster43.png" >
+<div class="alert alert-block alert-info"> 
+We can visualize specific genes, for example the ones discussed in PILOT manuscript (MYBPC3,MYOM1, and FHOD3). In the plot, the orange line indicates the fit in the target cell type (shown as orange lines) compared to other cell types (represented by grey lines). Plots of genes are saved at 'plot_genes_for_healthy_CM' folder.
 </div>
 
-</br>
 
-In the table, you can check the curves activities of some genes of the Myofib:
-| Gene ID | Expression pattern       | adjusted P-value | R-squared | mod_rsquared_adj | Terminal_logFC | Transient_logFC | Switching_time | area  | cluster |
-|---------|--------------------------|------------------|-----------|------------------|----------------|-----------------|----------------|-------|---------|
-| LAMA2   | linear up quadratic down | 0.00             | 0.38      | 0.74             | -0.15          | 0.05            | 0.76           | 28.73 | 2       |
-| COL1A1  | linear down quadratic up | 0.00             | 0.31      | 0.68             | 0.14           | -0.39           | 0.86           | 48.08 | 1       |
-| NEGR1   | quadratic down           | 0.00             | 0.28      | 0.67             | -0.17          | 0               | 0.64           | 16.17 | 2       |
-| COL3A1  | linear down quadratic up | 0.00             | 0.28      | 0.67             | 0.12           | -0.63           | 0.89           | 54.59 | 1       |
-| ZBTB20  | linear up quadratic down | 0.00             | 0.27      | 0.68             | -0.15          | 0.08            | 0.77           | 30.9  | 2       |
-| ...     | ...                      | ...              | ...       | ...              | ...            | ...             | ...            |       |         |
-| MAN2A1  | quadratic down           | 0.00             | -0.35     | 0.40             | -0.17          | 0               | 0.65           | 17.56 | 2       |
-| ADGRD1  | linear down              | 0.00             | -0.36     | 0.42             | -0.17          | 0               | 0.49           | 1.45  | 4       |
-| ATR     | quadratic down           | 0.00             | -0.36     | 0.39             | -0.17          | 0               | 0.65           | 17.71 | 2       |
-| URI1    | quadratic down           | 0.00             | -0.36     | 0.39             | -0.17          | 0               | 0.65           | 17.49 | 2       |
-| TRA2A   | quadratic down           | 0.01             | -0.39     | 0.38             | -0.17          | 0               | 0.65           | 17.01 | 2       |
-
-The complete table can be found here: [Myofib_curves_activities.csv](https://costalab.ukaachen.de/open_data/PILOT/Myofib_curves_activities.csv)
+```python
+pl.pl.exploring_specific_genes(cluster_name = 'healthy_CM', gene_list = ['MYBPC3','MYOM1','FHOD3'])
+```    
+![png](Myocardial_infarction_files/Myocardial_infarction_31_1.png)
+
 
 ###### Plot specific genes:
 <div class="alert alert-block alert-info"> 
diff --git a/docs/source/Myocardial_infarction_files/Myocardial_infarction_31_1.png b/docs/source/Myocardial_infarction_files/Myocardial_infarction_31_1.png
index 6bf45a7..ea8e6b9 100644
Binary files a/docs/source/Myocardial_infarction_files/Myocardial_infarction_31_1.png and b/docs/source/Myocardial_infarction_files/Myocardial_infarction_31_1.png differ
diff --git a/docs/source/Myocardial_infarction_files/healthy_CM_hallmark.png b/docs/source/Myocardial_infarction_files/healthy_CM_hallmark.png
index 195b0af..b073aad 100644
Binary files a/docs/source/Myocardial_infarction_files/healthy_CM_hallmark.png and b/docs/source/Myocardial_infarction_files/healthy_CM_hallmark.png differ
diff --git a/docs/source/index.md b/docs/source/index.md
index 843c6ef..4c11a8c 100644
--- a/docs/source/index.md
+++ b/docs/source/index.md
@@ -63,7 +63,7 @@ Myocardial_infarction
 ```{toctree}
 ---
 maxdepth: 2
-caption: Pathomics data Analysis
+caption: Multimodal Integration
 ---
 Combination_Kidney_IgAN
 ```
diff --git a/pilotpy/plot/__init__.py b/pilotpy/plot/__init__.py
index 30f49bc..8c3f41a 100644
--- a/pilotpy/plot/__init__.py
+++ b/pilotpy/plot/__init__.py
@@ -1,5 +1,5 @@
 from .ploting import *
 from .curve_activity import *
 from .gene_selection_analysis import *
-from .pseudobulk_DE_analysis import *
+
 
diff --git a/pilotpy/plot/pseudobulk_DE_analysis.py b/pilotpy/plot/pseudobulk_DE_analysis.py
deleted file mode 100644
index 1846af5..0000000
--- a/pilotpy/plot/pseudobulk_DE_analysis.py
+++ /dev/null
@@ -1,666 +0,0 @@
-#!/usr/bin/env python3
-# -*- coding: utf-8 -*-
-"""
-Created on Mon Apr 22 14:43:35 2024
-
-@author: Mina Shaigan
-"""
-
-import os
-import pandas as pd
-import numpy as np
-import anndata as ad
-import itertools
-from sklearn.feature_selection import VarianceThreshold
-from sklearn.decomposition import PCA
-from matplotlib.lines import Line2D
-
-from adjustText import adjust_text
-from gprofiler import GProfiler
-import textwrap as tw
-import seaborn as sns
-import matplotlib.pyplot as plt
-
-import rpy2.robjects as robjects
-import rpy2.robjects.numpy2ri
-from rpy2.robjects import pandas2ri
-
-from rpy2.rinterface_lib.callbacks import logger as rpy2_logger
-import logging
-rpy2_logger.setLevel(logging.ERROR)
-pandas2ri.activate()
-
-
-def plot_cell_numbers(adata, proportion_df,
-                      cell_type: str = None,
-                      cluster_col: str = "Predicted_Labels",
-                      celltype_col: str = "cell_types",
-                      sample_col: str = "sampleID",
-                      my_pal = None):
-    """
-    
-
-    Parameters
-    ----------
-    adata : TYPE
-        DESCRIPTION.
-    proportion_df : TYPE
-        DESCRIPTION.
-    cell_type : str, optional
-        DESCRIPTION. The default is None.
-    cluster_col : str, optional
-        DESCRIPTION. The default is "Predicted_Labels".
-    celltype_col : str, optional
-        DESCRIPTION. The default is "cell_types".
-    sample_col : str, optional
-        DESCRIPTION. The default is "sampleID".
-    my_pal : TYPE, optional
-        DESCRIPTION. The default is None.
-
-    Returns
-    -------
-    None.
-
-    """
-    
-    copy_cells = adata.obs.copy()
-    copy_cells= copy_cells[copy_cells[celltype_col] == cell_type]
-    copy_cells['group'] = proportion_df.loc[copy_cells[sample_col]][cluster_col].values
-    data = copy_cells.groupby(['group', sample_col])[sample_col].count()
-    data = pd.DataFrame(data)
-    data = data.loc[~(data==0).all(axis=1)]
-
-    n_groups = np.unique(data.index.get_level_values("group").values)
-    if my_pal is None:
-        if len(n_groups) == 3:
-            my_pal = dict(zip(n_groups, ["tab:red", "skyblue", "tab:blue"]))
-        else:
-            my_pal = dict(zip(n_groups, sns.color_palette("tab10", len(n_groups))))
-
-    plt.figure(figsize=(20,5))
-    x_values = data.index.get_level_values(sample_col).values
-    plt.bar(range(data.shape[0]),data[sample_col].values,
-                      color = [my_pal[key] for key in data.index.get_level_values("group").values],
-                      tick_label = x_values)
-    plt.xticks(fontsize=24, rotation = 45, ha= 'center')
-    plt.yticks(fontsize=24)
-
-    plt.title(cell_type, fontsize = 24)
-    plt.ylabel('Number of cells', fontsize = 24)    
-    colors = my_pal     
-    labels = list(colors.keys())
-    handles = [plt.Rectangle((0,0),1,1, color=colors[label]) for label in labels]
-    plt.legend(handles, labels, fontsize = 24)
-    plt.show()
-    
-def compute_pseudobulk_DE(
-        cluster_counts: pd.DataFrame = None,
-        cluster_metadata: pd.DataFrame = None,
-        group1: str = None,
-        group2: str = None,
-        cluster_col: str = None):
-    
-    """
-    Parameters
-    ----------
-    aggr_counts : pd.DataFrame, optional
-        DESCRIPTION. The default is None.
-    metadata : pd.DataFrame, optional
-        DESCRIPTION. The default is None.
-    cell_type : str, optional
-        DESCRIPTION. The default is None.
-    group1 : str, optional
-        DESCRIPTION. The default is None.
-    group2 : str, optional
-        DESCRIPTION. The default is None.
-    n_cpus : int, optional
-        DESCRIPTION. The default is 8.
-
-    Returns
-    -------
-    my_stat_res : TYPE
-        DESCRIPTION.
-
-    """
-
-    # consider DE between two group of interest
-    my_cluster_metadata = cluster_metadata[ (cluster_metadata[cluster_col] == group1 ) | (cluster_metadata[cluster_col] == group2)]
-    my_cluster_counts = cluster_counts.loc[my_cluster_metadata.index]
-
-    R = robjects.r
-    R('library(SingleCellExperiment)')
-    R('library(DESeq2)')
-    R('library(apeglm)')
-    R('library(tidyverse, verbose = FALSE)')
-    R.assign('cluster_counts', my_cluster_counts)
-    R.assign('cluster_metadata', my_cluster_metadata)
-
-    try:
-
-        R('dds <- DESeqDataSetFromMatrix(round(t(cluster_counts)), colData = cluster_metadata, design = ~ stage)')
-        R('rld <- rlog(dds, blind = TRUE)')
-        R('dds <- DESeq(dds)')
-        R(' \
-        mylist <- list(resultsNames(dds)); \
-        for(coef in resultsNames(dds)){ \
-            if(coef != "Intercept"){ \
-                print(coef); \
-                res <- results(dds, name = coef, alpha = 0.05); \
-                res <- lfcShrink(dds, coef = coef, res = res, type = "apeglm"); \
-                res_tbl <- res %>% data.frame() %>% rownames_to_column(var = "gene") %>% as_tibble() %>% arrange(padj); \
-                mylist[[coef]] <- res_tbl; \
-            } \
-        } \
-        ')
-        
-        res = R('''mylist''')
-        return res
-    except rpy2.rinterface_lib.embedded.RRuntimeError:
-        return None
-    
-def compute_pseudobulk_PCA(
-        cluster_counts: pd.DataFrame = None,
-        cluster_metadata: pd.DataFrame = None):
-    
-    """
-    Parameters
-    ----------
-    aggr_counts : pd.DataFrame, optional
-        DESCRIPTION. The default is None.
-    metadata : pd.DataFrame, optional
-        DESCRIPTION. The default is None.
-    cell_type : str, optional
-        DESCRIPTION. The default is None.
-    group1 : str, optional
-        DESCRIPTION. The default is None.
-    group2 : str, optional
-        DESCRIPTION. The default is None.
-    n_cpus : int, optional
-        DESCRIPTION. The default is 8.
-
-    Returns
-    -------
-    my_stat_res : TYPE
-        DESCRIPTION.
-
-    """
-
-    
-    # consider DE between two group of interest
-
-    R = robjects.r
-    R('library(SingleCellExperiment)')
-    R('library(DESeq2)')
-    R('library(apeglm)')
-    R('library(tidyverse, verbose = FALSE)')
-    R.assign('cluster_counts', cluster_counts)
-    R.assign('cluster_metadata', cluster_metadata)
-
-    try:
-
-        R('dds <- DESeqDataSetFromMatrix(round(t(cluster_counts)), colData = cluster_metadata, design = ~ stage)')
-        R('rld <- rlog(dds, blind = TRUE)')
-        R('dds <- DESeq(dds)')
-        rld = R('''as.data.frame(assay(rld))''')        
-        return rld
-    except rpy2.rinterface_lib.embedded.RRuntimeError:
-        return None
-    
-def plotPCA_subgroups(proportions, deseq2_counts, cell_type, my_pal, cluster_col):
-    # consider top variances features
-    selector = VarianceThreshold(0.2)
-    new_deseq2_counts = selector.fit_transform(deseq2_counts)
-    new_deseq2_counts = pd.DataFrame(new_deseq2_counts, index = deseq2_counts.index)
-    
-    # reduce dimension by PCA
-    pca = PCA(n_components=2)
-    X_pca = pca.fit_transform(new_deseq2_counts)
-    
-    # plot PCA
-    color_map = [my_pal[val] for val in proportions.loc[new_deseq2_counts.index, cluster_col]]
-    fig, ax = plt.subplots()
-    ax.scatter(X_pca[:, 0], X_pca[:, 1],
-                c = color_map,
-                cmap='viridis', edgecolor='k', s = 200)
-    plt.xlabel('PC1: ' + str(round(pca.explained_variance_ratio_[0]*100)) + "% variance", fontsize = 24)
-    plt.ylabel('PC2: ' + str(round(pca.explained_variance_ratio_[1]*100)) + "% variance", fontsize = 24)
-    plt.title('PCA ' + str(cell_type))
-    legend_elements = []
-    for k in my_pal.keys():
-        legend_elements.append(Line2D([0], [0], marker='o', color='w', label=k,
-                              markerfacecolor=my_pal[k], markersize=15))
-    ax.legend(handles=legend_elements, loc=1)
-    plt.show()
-    
-def map_color_ps(a, low_fc_thrr, high_fc_thrr, pv_thrr):
-    log2FoldChange, symbol, nlog10 = a
-    if log2FoldChange >= high_fc_thrr and nlog10 >= pv_thrr:
-        return 'very higher'
-    elif log2FoldChange <= -low_fc_thrr and nlog10 >= pv_thrr:
-        return 'very lower'
-    else:
-        return 'no'
-
-def volcano_plot_ps(data, symbol, foldchange, p_value,
-                 cell_type,
-                 feature1,
-                 feature2,
-                 low_fc_thr = 1,
-                 high_fc_thr = 1,
-                 pv_thr = 1,
-                 figsize = (20,10),
-                 output_path = None,
-                 my_pal = None,
-                 fontsize: int = 14
-                ):
-    """
-    
-
-    Parameters
-    ----------
-    data : TYPE
-        DESCRIPTION.
-    symbol : TYPE
-        DESCRIPTION.
-    foldchange : TYPE
-        DESCRIPTION.
-    p_value : TYPE
-        DESCRIPTION.
-    cell_type : TYPE
-        DESCRIPTION.
-    feature1 : TYPE
-        DESCRIPTION.
-    feature2 : TYPE
-        DESCRIPTION.
-    low_fc_thr : TYPE, optional
-        DESCRIPTION. The default is 1.
-    high_fc_thr : TYPE, optional
-        DESCRIPTION. The default is 1.
-    pv_thr : TYPE, optional
-        DESCRIPTION. The default is 1.
-    figsize : TYPE, optional
-        DESCRIPTION. The default is (20,10).
-    output_path : TYPE, optional
-        DESCRIPTION. The default is None.
-    my_pal : TYPE, optional
-        DESCRIPTION. The default is None.
-    fontsize : int, optional
-        DESCRIPTION. The default is 14.
-
-    Returns
-    -------
-    str
-        DESCRIPTION.
-
-    """
-    
-    df = pd.DataFrame(columns=['log2FoldChange', 'nlog10', 'symbol'])
-    df['log2FoldChange'] = data[foldchange]
-    df['nlog10'] = -np.log10(data[p_value].values)
-    df['symbol'] = data[symbol].values
-    
-    color1 = my_pal[feature1]
-    color2 = my_pal[feature2]    
-    
-    df.replace([np.inf, -np.inf], np.nan, inplace=True)
-    df.dropna(subset=["nlog10"], how="all", inplace=True)
-    
-
-    selected_labels = df.loc[ (df.log2FoldChange <= low_fc_thr) & (df.log2FoldChange >= high_fc_thr) & \
-                             (df['nlog10'] >= pv_thr)]['symbol'].values
-    
-    def map_shape(symbol):
-        if symbol in selected_labels:
-            return 'important'
-        return 'not'
-    
-    df['color'] = df[['log2FoldChange', 'symbol', 'nlog10']].apply(map_color_ps, low_fc_thrr = low_fc_thr, 
-                                                                   high_fc_thrr = high_fc_thr,
-                                                                   pv_thrr = pv_thr, axis = 1)
-    df['shape'] = df.symbol.map(map_shape)
-    df['baseMean'] = df.nlog10*10
-
-    
-    plt.figure(figsize = figsize, frameon=False, dpi=100)
-    plt.style.use('default')
-
-    ax = sns.scatterplot(data = df, x = 'log2FoldChange', y = 'nlog10', 
-                         hue = 'color', hue_order = ['no', 'very higher', 'very lower'],
-                         palette = ['lightgrey', color2, color1],
-                         style = 'shape', style_order = ['not', 'important'],
-                         markers = ['o', 'o'], 
-                         size = 'baseMean', sizes = (40, 400)
-                        )
-
-    ax.axhline(pv_thr, zorder = 0, c = 'k', lw = 2, ls = '--')
-    ax.axvline(high_fc_thr, zorder = 0, c = 'k', lw = 2, ls = '--')
-    ax.axvline(-low_fc_thr, zorder = 0, c = 'k', lw = 2, ls = '--')
-
-    texts = []
-    for i in range(len(df)):
-        if df.iloc[i].nlog10 >= pv_thr and (df.iloc[i].log2FoldChange >= high_fc_thr):
-            texts.append(plt.text(x = df.iloc[i].log2FoldChange, y = df.iloc[i].nlog10, s = df.iloc[i].symbol,
-                                 fontsize = fontsize, weight = 'bold', family = 'sans-serif'))
-        if df.iloc[i].nlog10 >= pv_thr and ( df.iloc[i].log2FoldChange <= -low_fc_thr):
-            texts.append(plt.text(x = df.iloc[i].log2FoldChange, y = df.iloc[i].nlog10, s = df.iloc[i].symbol,
-                                 fontsize = fontsize + 2, weight = 'bold', family = 'sans-serif'))
-    adjust_text(texts)
-
-    custom_lines = [Line2D([0], [0], marker='o', color='w', markerfacecolor=color2, markersize=fontsize),
-                   Line2D([0], [0], marker='o', color='w', markerfacecolor=color1, markersize=fontsize)]
-
-    plt.legend(custom_lines, ['Higher expressions in ' + feature2, 'Higher expressions in ' + feature1], loc = 1,
-               bbox_to_anchor = (1,1.1), frameon = False, prop = {'weight': 'normal', 'size': fontsize})
-
-    for axis in ['bottom', 'left']:
-        ax.spines[axis].set_linewidth(2)
-
-    ax.spines['top'].set_visible(False)
-    ax.spines['right'].set_visible(False)
-
-    ax.tick_params(width = 2)
-    ax.set_ylim(bottom=0)
-    plt.title("Expression Score \n " + feature1 + " - " + feature2, fontsize = fontsize + 4)
-    plt.xticks(size = fontsize, weight = 'bold')
-    plt.yticks(size = fontsize, weight = 'bold')
-
-    plt.xlabel("$log_{2}$ (Fold Change)", size = fontsize + 2)
-    plt.ylabel("-$log_{10}$ (P-value)", size = fontsize + 2)
-
-    if output_path is not None:
-        plt.savefig(output_path + "/volcano_" + str(feature1) + "-" + str(feature2) + "_FC.pdf",
-                    dpi = 100, bbox_inches = 'tight', facecolor = 'white')
-    plt.show()
-    
-def gene_annotation_cell_type_subgroup(data: pd.DataFrame = None,
-                                       symbol: str = 'gene',
-                                       sig_col: str = 'significant_gene',
-                                       cell_type: str = None,
-                                       group: str = None,
-                                       sources: str = None,
-                                       num_gos: int = 10,
-                                       fig_h: int = 6,
-                                       fig_w: int = 4,
-                                       font_size: int = 14,
-                                       max_length:int = 50,
-                                       path_to_results: str = None,
-                                       my_pal = None
-                                     ):
-    """
-    Plot to show the most relative GO terms for specifc cell-type of determind patient sub-group
-
-    Parameters
-    ----------
-    data : pd.DataFrame
-        DESCRIPTION. The default is None.
-    symbol : str, optional
-        DESCRIPTION. The default is 'gene'.
-    sig_col : str, optional
-        DESCRIPTION. The default is 'significant_gene'.
-    cell_type : str
-        DESCRIPTION. The default is None.
-    group : str
-        DESCRIPTION. The default is None.
-    sources : str, optional
-        DESCRIPTION. The default is None.
-    num_gos : int, optional
-        DESCRIPTION. The default is 10.
-    fig_h : int, optional
-        DESCRIPTION. The default is 6.
-    fig_w : int, optional
-        DESCRIPTION. The default is 4.
-    font_size : int, optional
-        DESCRIPTION. The default is 14.
-    max_length : int, optional
-        DESCRIPTION. The default is 50.
-    path_to_results : str, optional
-        DESCRIPTION. The default is None.
-    my_pal : TYPE, optional
-        DESCRIPTION. The default is None.
-
-    Returns
-    -------
-    None.
-
-    """
-
-#     path_to_results = 'Results_PILOT'
-
-    color = my_pal[group]
-
-#     group_genes = pd.read_csv(path_to_results + \
-#                               "/significant_genes_" + cell_type + "_" + group + ".csv")
-
-    group_genes = data.loc[data[sig_col] == group, symbol].values
-    gp = GProfiler(return_dataframe = True)
-    if list(group_genes):
-        gprofiler_results = gp.profile(organism = 'hsapiens',
-                                       query = list(group_genes),
-                                       no_evidences = False,
-                                       sources = sources)
-    else:
-        return "Genes list is empty!"
-    
-    if(gprofiler_results.shape[0] == 0):
-        return "Not enough information!"
-    elif(gprofiler_results.shape[0] < num_gos):
-        num_gos = gprofiler_results.shape[0]
-
-    all_gprofiler_results = gprofiler_results.copy()
-    # display(all_gprofiler_results.head())
-       
-    # print(len(list(group_genes['symbol'].values)))
-    # selected_gps = gprofiler_results.loc[0:num_gos,['name', 'p_value']]
-    selected_gps = gprofiler_results.head(num_gos)[['name', 'p_value']]
-    
-    selected_gps['nlog10'] = -np.log10(selected_gps['p_value'].values)
-
-    for i in selected_gps.index:
-        split_name = "\n".join(tw.wrap(selected_gps.loc[i, 'name'], max_length))
-        selected_gps.loc[i, 'name'] = split_name
-    
-    figsize = (fig_h, fig_w)
-
-    plt.figure(figsize = figsize, dpi = 100)
-    plt.style.use('default')
-    sns.scatterplot(data = selected_gps, x = "nlog10", y = "name", s = 300, color = color)
-
-    plt.title('GO enrichment in ' + cell_type + ' associated with ' + group + \
-              '\n (number of genes: ' + str(len(list(group_genes))) + ")", fontsize = font_size + 2)
-
-    plt.xticks(size = font_size)
-    plt.yticks(size = font_size)
-
-    plt.ylabel("GO Terms", size = font_size)
-    plt.xlabel("-$log_{10}$ (P-value)", size = font_size)
-    
-    save_path = path_to_results + '/'
-    if not os.path.exists(save_path):
-            os.makedirs(save_path)
-#     plt.savefig(save_path + group + ".pdf", bbox_inches = 'tight',
-#                 facecolor = 'white', transparent = False)
-    plt.show()
-    
-    all_gprofiler_results.to_csv(save_path + group + ".csv")
-    
-def get_sig_genes(data, symbol, foldchange, p_value, cell_type,
-                 feature1, feature2,
-                 low_fc_thr = 1, high_fc_thr = 1, pv_thr = 1):
-    df = pd.DataFrame(columns=['log2FoldChange', 'nlog10', 'symbol'])
-    df['log2FoldChange'] = data[foldchange]
-    df['nlog10'] = -np.log10(data[p_value].values)
-    df['symbol'] = data[symbol].values
-    
-    df.replace([np.inf, -np.inf], np.nan, inplace = True)
-    df.dropna(subset = ["nlog10"], how = "all", inplace = True)
-    
-    data['significant_gene'] = ""
-    group1_selected_labels = df.loc[ (df.log2FoldChange <= -low_fc_thr) & (df['nlog10'] >= pv_thr), 'symbol'].values
-    data.loc[data[symbol].isin(group1_selected_labels), 'significant_gene'] = feature1
-    
-    group2_selected_labels = df.loc[ (df.log2FoldChange >= high_fc_thr) & (df['nlog10'] >= pv_thr), 'symbol'].values
-    data.loc[data[symbol].isin(group2_selected_labels), 'significant_gene'] = feature2
-
-    return data
-
-def get_pseudobulk_DE(adata: ad.AnnData,
-                      proportion_df: pd.DataFrame,
-                      cell_type: str,
-                      fc_thr: list,
-                      pv_thr: float = 0.05,
-                      celltype_col: str = "cell_types",
-                      sample_col: str = "sampleID",
-                      cluster_col: str = "Predicted_Labels",
-                      remove_samples: list = [],
-                      my_pal: dict = None,
-                      path_to_results: str = 'Results_PILOT/',
-                      figsize: tuple = (30, 15),
-                      num_gos: int = 10,
-                      fig_h: int = 6,
-                      fig_w: int = 4,
-                      sources: list = ['GO:CC', 'GO:PB', 'GO:MF'],
-                      fontsize: int = 14,
-                      load: bool = False
-                     ):
-    """
-    
-
-    Parameters
-    ----------
-    adata : ad.AnnData
-        DESCRIPTION.
-    proportion_df : pd.DataFrame
-        DESCRIPTION.
-    cell_type : str
-        DESCRIPTION.
-    fc_thr : list
-        DESCRIPTION.
-    pv_thr : float, optional
-        DESCRIPTION. The default is 0.05.
-    celltype_col : str, optional
-        DESCRIPTION. The default is "cell_types".
-    sample_col : str, optional
-        DESCRIPTION. The default is "sampleID".
-    cluster_col : str, optional
-        DESCRIPTION. The default is "Predicted_Labels".
-    remove_samples : list, optional
-        DESCRIPTION. The default is [].
-    my_pal : dict, optional
-        DESCRIPTION. The default is None.
-    path_to_results : str, optional
-        DESCRIPTION. The default is 'Results_PILOT/'.
-    figsize : tuple, optional
-        DESCRIPTION. The default is (30, 15).
-    num_gos : int, optional
-        DESCRIPTION. The default is 10.
-    fig_h : int, optional
-        DESCRIPTION. The default is 6.
-    fig_w : int, optional
-        DESCRIPTION. The default is 4.
-    sources : list, optional
-        DESCRIPTION. The default is ['GO:CC', 'GO:PB', 'GO:MF'].
-    fontsize : int, optional
-        DESCRIPTION. The default is 14.
-    load : bool, optional
-        DESCRIPTION. The default is False.
-
-    Returns
-    -------
-    None.
-
-    """
-    
-
-    n_clusters = np.unique(proportion_df[cluster_col])
-    if my_pal is None:
-        if len(n_clusters) == 3:
-            my_pal = dict(zip(n_clusters, ["tab:red", "skyblue", "tab:blue"]))
-        else:
-            my_pal = dict(zip(n_clusters, sns.color_palette("tab10", len(n_clusters))))
-
-    save_path = path_to_results + "/Diff_Expressions_Results/" + str(cell_type) + "/pseudobulk/"
-    log_pv_thr = -np.log10(pv_thr)
-
-    print("Plot cells frequency for each sample... ")
-    plot_cell_numbers(adata, proportion_df, cell_type = cell_type,
-                  cluster_col = cluster_col, celltype_col = celltype_col,
-                      sample_col = sample_col, my_pal= my_pal)
-    
-    if load == False:
-        print("Aggregating the counts and metadata to the sample level...")
-        counts_df = adata.to_df()
-        counts_df[[celltype_col, sample_col]] = adata.obs[[celltype_col, sample_col]].values
-        
-        aggr_counts = counts_df.groupby([celltype_col, sample_col]).sum()
-    
-    
-        cluster_counts = aggr_counts.loc[cell_type]
-        cluster_metadata = proportion_df.loc[cluster_counts.index.values]
-        cluster_metadata['stage'] = cluster_metadata[cluster_col].values
-    
-        # remove unwanted samples
-        if not (remove_samples is None):
-            for sample in remove_samples:
-                if sample in cluster_metadata.index:
-                    cluster_metadata = cluster_metadata.drop(index = sample)
-                if sample in cluster_counts.index:
-                    cluster_counts = cluster_counts.drop(index = sample)
-    
-        cluster_metadata = cluster_metadata.loc[cluster_counts.index]
-        cluster_counts = cluster_counts.loc[:, (cluster_counts != 0).any(axis=0)]
-    
-        print("Use the median of ratios method for count normalization from DESeq2")
-        print("Use regularized log transform (rlog) of the normalized counts from DESeq2")
-        rld = compute_pseudobulk_PCA(cluster_counts, cluster_metadata)
-    
-        if rld is not None:
-            
-            if not os.path.exists(save_path):
-                os.makedirs(save_path)
-            rld.to_csv(save_path + "rld_PCA.csv")
-    else:
-        rld = pd.read_csv(save_path + "rld_PCA.csv", index_col = 0)
-        
-    deseq2_counts = rld.transpose()
-    print("Plot the first two principal components... ")
-    plotPCA_subgroups(proportion_df, deseq2_counts, cell_type, my_pal, cluster_col)
-
-    print("Performing the DE analysis... ")
-    j = 0
-    for groups in itertools.combinations(n_clusters, 2):
-        data = None
-        if load == False:
-            res = compute_pseudobulk_DE(cluster_counts, cluster_metadata,
-                                        group1 = groups[0],
-                                        group2 = groups[1],
-                                        cluster_col = cluster_col)
-            if res is not None:
-                with (robjects.default_converter + pandas2ri.converter).context():
-                    data = robjects.conversion.get_conversion().rpy2py(res[1])
-
-                data = get_sig_genes(data, 'gene', 'log2FoldChange', 'padj', cell_type, 
-                                     groups[0], groups[1], fc_thr[j], fc_thr[j], log_pv_thr)
-                
-                data.to_csv(save_path + "/" + str(groups[1]) + "vs" + str(groups[0]) + "_DE.csv")
-        else:
-            data = pd.read_csv(save_path + "/" + str(groups[1]) + "vs" + str(groups[0]) + "_DE.csv", index_col = 0)
-
-        if data is not None:
-            print("Plot volcano plot for " + str(groups[1]) + " vs " + str(groups[0]))
-            volcano_plot_ps(data, 'gene', 'log2FoldChange', 'padj', cell_type, 
-                         groups[0], groups[1], fc_thr[j], fc_thr[j], log_pv_thr, figsize = figsize,
-                         output_path = save_path + "/",
-                         my_pal = my_pal, fontsize = fontsize)
-
-            print("Plot GO analysis for " + str(groups[1]) + " vs " + str(groups[0]))
-            gene_annotation_cell_type_subgroup(data, cell_type = cell_type, group = groups[0],
-                                               sources = sources, num_gos = num_gos,
-                                               fig_h = fig_h, fig_w = fig_w, font_size = fontsize,
-                                               path_to_results = save_path + "/" + str(groups[1]) + "vs" + str(groups[0]) + "/GOs/",
-                                               my_pal = my_pal)
-            gene_annotation_cell_type_subgroup(data, cell_type = cell_type, group = groups[1],
-                                               sources = sources, num_gos = num_gos,
-                                               fig_h = fig_h, fig_w = fig_w, font_size = fontsize,
-                                               path_to_results = save_path + "/" + str(groups[1]) + "vs" + str(groups[0]) + "/GOs/",
-                                               my_pal = my_pal)
-        j += 1
\ No newline at end of file
diff --git a/pilotpy/tools/patients_sub_clustering.py b/pilotpy/tools/patients_sub_clustering.py
index 648d3bc..a3a8710 100644
--- a/pilotpy/tools/patients_sub_clustering.py
+++ b/pilotpy/tools/patients_sub_clustering.py
@@ -181,7 +181,7 @@ def compute_diff_expressions(adata,cell_type: str = None,
          cells=adata.uns[cell_type] 
     
     
-    
+    """
     import rpy2.robjects as robjects
     import rpy2.robjects.numpy2ri
     from rpy2.robjects import pandas2ri
@@ -239,7 +239,7 @@ def compute_diff_expressions(adata,cell_type: str = None,
     
 
 
-    
+    """
 def install_r_packages():
     """
     Install R packages using rpy2.
@@ -253,16 +253,6 @@ def install_r_packages():
         None
     """
     # Install R packages using rpy2
-    import rpy2.robjects as robjects
-
-    robjects.r('''
-    if (!requireNamespace("BiocManager", quietly = TRUE))
-        install.packages("BiocManager")
-    ''')
-
-    robjects.r('''
-    BiocManager::install("limma")
-    ''')
-   
+    print('Install rpy2 with conda')
 
    
diff --git a/setup.py b/setup.py
index 02f26d3..a127130 100644
--- a/setup.py
+++ b/setup.py
@@ -28,9 +28,7 @@
             "elpigraph-python>=0.3.1,<0.4.0",
             "adjusttext>=0.8,<0.9",
             "gprofiler-official>=1.0.0,<1.1.0",
-            "rpy2>=3.5.11",
-            
-            
+   
             
         ],
         packages=find_packages()

	gene	cluster	waldStat	pvalue	FC	Expression pattern	fit-pvalue	fit-mod-rsquared
2642	GAS7	Myofib	212.477292	8.487275e-46	1.086644	linear up quadratic down	1.873033e-107	0.570704
2151	EXT1	Myofib	125.383128	5.344198e-27	0.786136	linear up quadratic down	3.159831e-35	0.555757
4979	PKNOX2	Myofib	89.738712	2.492742e-19	0.855504	quadratic down	1.039404e-117	0.544122
2529	FN1	Myofib	70.641696	3.110595e-15	1.573680	linear down quadratic up	2.947389e-188	0.633774
1437	COL6A3	Myofib	54.751169	7.758841e-12	1.069156	linear down quadratic up	3.514298e-172	0.608543
5775	RORA	Myofib	52.486295	2.359167e-11	0.899459	quadratic down	7.232834e-174	0.587234
2832	GXYLT2	Myofib	24.247113	2.218154e-05	2.000205	linear up quadratic down	2.402171e-85	0.537920
3783	MGP	Myofib	23.244418	3.591226e-05	0.871041	quadratic down	1.327779e-225	0.571374
4726	PCDH9	Myofib	20.439646	1.376052e-04	0.604830	linear down	0.000000e+00	0.596035
1231	CHD9	Myofib	20.389564	1.409364e-04	0.527488	linear up quadratic down	7.658862e-77	0.559604
1710	DCN	Myofib	19.656307	1.999818e-04	1.033697	linear up quadratic down	1.866152e-284	0.588602
2824	GSN	Myofib	18.015612	4.366007e-04	0.638136	linear up quadratic down	2.942472e-279	0.601684
1392	COL3A1	Myofib	17.276479	6.199787e-04	1.240454	linear down quadratic up	0.000000e+00	0.665616
1372	COL1A2	Myofib	14.068816	2.812963e-03	1.327753	linear down quadratic up	0.000000e+00	0.655032
7245	VCAN	Myofib	12.610158	5.560192e-03	0.838764	linear down quadratic up	1.761922e-164	0.571981
	gene	cluster	waldStat	pvalue	FC	Expression pattern	fit-pvalue	fit-mod-rsquared
6165	SORBS1	healthy_CM	1574.665604	0.000000e+00	1.296470	linear down quadratic up	8.946560e-05	0.522953
1772	DLG2	healthy_CM	1055.313030	1.801893e-228	1.155496	linear down quadratic up	1.323610e-256	0.556306
6733	THSD4	healthy_CM	834.288239	1.583902e-180	1.671315	linear down quadratic up	6.088694e-250	0.582085
1276	CMYA5	healthy_CM	752.301407	9.561746e-163	1.559703	linear down quadratic up	3.774063e-66	0.527869
3281	LDB3	healthy_CM	542.239458	3.342198e-117	1.426196	linear down quadratic up	1.511694e-238	0.546327
36	ABLIM1	healthy_CM	379.423867	6.335728e-82	0.979378	linear up	3.296026e-13	0.513734
6903	TNNT2	healthy_CM	373.037957	1.530428e-80	1.392561	linear down quadratic up	2.329698e-118	0.535236
2398	FHOD3	healthy_CM	343.341326	4.125161e-74	1.731741	linear down quadratic up	0.000000e+00	0.612758
6663	TECRL	healthy_CM	338.848075	3.875199e-73	1.261289	linear up quadratic down	0.000000e+00	0.570571
4056	MYBPC3	healthy_CM	296.157297	6.751814e-64	0.686940	linear up quadratic down	0.000000e+00	0.557570
5652	RCAN2	healthy_CM	287.996090	3.940736e-62	1.214055	linear down quadratic up	0.000000e+00	0.566313
1830	DOCK3	healthy_CM	269.653643	3.667754e-58	0.534836	linear down quadratic up	2.678914e-202	0.527979
4177	MYOM1	healthy_CM	236.482875	5.483541e-51	1.637375	linear down	1.381677e-268	0.548281
1915	EFNA5	healthy_CM	236.263957	6.115035e-51	1.089847	linear down	1.127515e-164	0.532078
5436	PXDNL	healthy_CM	227.066418	5.957588e-49	1.284827	linear down quadratic up	1.815885e-03	0.518712