Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I by scienceangel

View this thread on steempeak.com
· @scienceangel ·
$93.42
Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I
In the previous [post](https://steemit.com/steemstem/@scienceangel/lab-diaries-4-whole-transcriptome-and-proteome-analysis-by-rna-sequencing-and-tmt-ms) I have explained how we obtained our RNA-seq and TMT-MS data, and today I will introduce you to a very useful tool for analysis of such large-scale biological data.

<center>![Picture2.png](https://steemitimages.com/DQmVXY2AwNSYcZCUp9Kvyki9oiFoBaGBgcwRuB8CxQo6rSM/Picture2.png)</center>
<center>*Screenshot of GSEA results*</center>

## Finding a meaning in your meaningless data
_________

Imagine that you've analysed your RNA seq data, and after analysis you saw that you have more than 5000 differentially expressed genes after you have induced expression of a particular gene in your cells...

**What the .... am I supposed to do with that information?!?**

<div class=pull-left>

In each of our cells there are many, many (and one more time - many!) signal transduction pathways that include thousands of proteins, which are tightly regulated to keep our cells alive and functioning. In cancer cells, due to the activation of oncogenes, many of those pathways are deregulated to provide advantages to cancer cells over normal cells. These changes make them immortal, allowing them to grow indefinitely and eventually to migrate and invade other parts of our body. 

</div>

<div class=pull-right>

<center>![800px-Signal_transduction_pathways.svg.png](https://steemitimages.com/DQmVGddMm5JHaUMo1yro3pGiXu8G78XAiKHptYgsquSgKu9/800px-Signal_transduction_pathways.svg.png)</center>
<center> <sup>[Simplified representation of major signal transduction pathways](https://en.wikipedia.org/wiki/File:Signal_transduction_pathways.svg), by cybertory, [CC BY-SA 3.0](https://creativecommons.org/licenses/by-sa/3.0/deed.en)</sup> </center>

</div>

<br><br><br><br> <br><br><br><br><br><br><br><br><br><br>

Now let's get back to our "small" problem of having 5000 over- or under-expressed genes in our cells with high expression of an oncogene, compared to same cells with low expression of the same oncogene.

How can we know in which of these signal transduction pathways are all those genes included, and what is the biological meaning of those 5000 genes being changed?

Well, we could just go and analyse one by one gene and if we are lucky, just before retirement we would succeed in analyzing them all and finally publish our results 30 years after we performed an actual experiment...

### Gene Set Enrichment Analysis (GSEA)
_________

... or we could just use **Gene Set Enrichment Analysis (GSEA)** and publish our results in only 3 years, yeah!

If we're lucky... I mean, if we have any good results actually... Never mind, just keep reading...

GSEA is a very useful computational tool for interpreting gene expression data, such as microarray, RNA seq data, etc. The advantage of GSEA over other methods is that, instead of focusing on analysis of single genes, it performs the analysis of a *group of genes*. In this way, changes in pathways reflected through small, but coordinated change of several genes can be detected, leading to potential elucidation of biologically significant changes relevant for eg. process of carcinogenesis.

#### Gene sets
______

By using GSEA, we are actually trying to put our obtained results of differential expression into the previous, already known biological context. This is achieved by using the *gene sets*, which represent group of genes that are grouped together based on their common biological function and/or involvement in the same biological pathways. Those gene sets are formed according to already published biological data containing biochemical pathways or coexpression of functionally related genes. They are publicly available in the form of Molecular Signatures Database (MSigDB) on the [Broad Institute](http://software.broadinstitute.org/gsea/msigdb/collections.jsp) web page.

#### GSEA principle
_________

>Gene Set Enrichment Analysis (GSEA) is a computational method that determines whether an a priori defined set of genes shows statistically significant, concordant differences between two biological states (e.g. phenotypes).

Definition taken from official [GSEA user guide](http://software.broadinstitute.org/gsea/doc/GSEAUserGuideFrame.html), because there's no other way to explain mathematical method behind the analysis. The algorithm used is fully described in this [paper](http://software.broadinstitute.org/gsea/doc/subramanian_tamayo_gsea_pnas.pdf), and it's based on weighted Kolmogorovโ€“Smirnov-like statistics.

Luckily for us biologists, our friends mathematicians have developed a user-friendly software, that performs analysis for us, the "only" thing we need to do is to prepare input files with our data in the form that will be recognized by the software. In the following lines I have prepared a detailed guideline on how to prepare your data and run GSEA.

#### GSEA Tutorial
__________

<div class=pull-left>

![Java.png](https://steemitimages.com/DQmXuSJh8gDtGw6S8x9FxQXWYToBgyFswnbHASQRZiZouQo/Java.png)
<center> <sup>[Java](https://commons.wikimedia.org/wiki/File:Java.png), public domain</sup> </center>
</div>

<div class=pull-right>

GSEA Software runs on Java, so before downloading and installing software, you need to make sure that you have Java 8 installed on your computer.

Very important information - GSEA Software is available is several memory configurations: **1, 2, 4** and **8 GB**. GSEA of 1 GB can be used with 32- or 64-bit Java 8, other configurations require 64-bit Java 8 only!

</div>

<br><br><br><br> <br><br><br><br><br><br><br><br><br>

If you run 64-bit operative system (Windows) on your computer, I recommend you to install 64-bit Java 8 and GSEA with higher memory configurations (2, 4 or 8 GB). Of course, you have to choose a memory configuration *smaller* than total RAM memory of your computer!

This has one very important practical implication - when you're analyzing very large data sets (eg. more than 10 000 genes) and using databases with large number of gene sets (we'll come to that later), it often happens that Java runs out of memory, and GSEA Software cannot perform the analysis. For example, my laptop has 8 GB RAM, 64-bit Windows 7 Ultimate, and I'm running 4 GB GSEA (quite enough for all the analyses I performed).

After you have installed Java 8, you can head to [GSEA Downloads](http://software.broadinstitute.org/gsea/downloads.jsp) page and download one of the GSEA Software configurations.
**Note - you will have to register to be able to access downloads page, your e-mail is required only.**

<center>![gsea download.png](https://steemitimages.com/DQmVHf8DVjoxgg5FPFdvXLK9hf9xFhJSr28bYtrPuBdEGmr/gsea%20download.png)</center>
<center>*GSEA Downloads page*</center>

**Running the software**

If you've successfully installed your GSEA Software, after launching it the main window opens, and it should look something like this:

![gsea main load data.png](https://steemitimages.com/DQmVV9KGhb5wwVRkUKZEdAnLkjmC1CbWYpxzigwwE2cCWDx/gsea%20main%20load%20data.png)

In the upper left corner click on *Load data*, and the following window opens:

![gsea browse.png](https://steemitimages.com/DQmcPtM7TyLgrHgqFyNPbkT6FfpD5HaVrHA8EV7bPk4kCoR/gsea%20browse.png)

Now we have arrived at the most important part of GSEA analysis - **transforming your data into the GSEA input files**. I'm saying this is the most important part because if you fail to create input files in proper format(s) acceptable by the GSEA, the software will report an error after you load the files and you won't be able to run your analysis.

So pay close attention! :)

If you take a closer look at the upper right part of the last image, you will see that software informs you about the acceptable data formats, Basically, it is essential that you have prepared the following two file formats before running your analysis:

**1.  Expression data set file** (there are several options, but *.gct file* works perfectly fine for me) - this is your expression data set, actual data you obtained after performing RNA seq analysis

**2. Phenotype labels file** (*.cls file*)

**Preparing .gct file**

Usually your RNA seq data is contained (after initial processing) in one or more Excel files (you can see how it looks in my previous [post](https://steemit.com/steemstem/@scienceangel/lab-diaries-4-whole-transcriptome-and-proteome-analysis-by-rna-sequencing-and-tmt-ms). To be able to analyse it in GSEA, you need to adapt it into something that GSEA Software can read and understand, and that's called *.gct file.*

In the first step of creating your .gct file, open a blank Excel sheet and from your RNA seq data file copy and paste columns containing gene names and all replicates of your data.

<center>![gct file.png](https://steemitimages.com/DQmYbenzGTSo3DxXdrtdxSTrybwo1bYgMEBsQhwGYyCRRP9/gct%20file.png)</center>
<center>*First step in creating a .gct file*</center>

As you can see in the image, it is also necessary to have an additional column in between NAME and your replicates, which is called DESCRIPTION (I usually leave this column empty).

In the first cell of the first row (number 1 in red) you need to write this **#1.2**, and this is what each .gct file must contain, it represents the version of .gct file and GSEA uses this particular version/form.

In the first cell of the second row, you must input **the number of your genes contained in .gct file, genes that you're analyzing** (number 2 in red, in my example I had 9616 genes for the analysis). This is very important step, because if the number in this cell doesn't match the number of genes in .gct file, the software will report an error and won't work.

Finally, in the second cell of the second row (number 3 in red) you need to input **the number of samples** you're analyzing. In my example, I had 6 samples - 2 triplicates.

The order of samples matter as well, meaning that samples you're focusing your research on should be put first. In my example, I was interested in comparing expression in U266M samples with U266C samples (controls).

When you have finished creating Excel .gct file and it looks like mine example from above, make sure you save it as Excel file first. Then you need to save it as **text (Tab delimited)** file as well. So now you should have two files - Excel and text file.

Final step - open your text file and select Save as, then select All files, and just manually type extension at the end - .gct.

You should now have three files in your GSEA folder: Excel, text and .gct file.

**Preparing phenotype labels (.cls) file**

Phenotype labels file serves to provide information to GSEA Software on how many samples your .gct file contains, how many *different phenotypes* and which phenotypes are located in which cells.

Let's proceed to example immediately:

<center>![cls file.png](https://steemitimages.com/DQmS72etsjVHn768x7y8Ka27uMmBy9ouMK2pFMRmnd4Qcn9/cls%20file.png)</center>
<center>*First step in creating .cls file*</center>

In the first row you can see following numbers: 6, 2 and 1.

6 represents number of samples you have.

2 represents number of different phenotypes you're analyzing. That means - tumor vs. control tissue, treated vs. non-treated cells, transformed vs. non-transformed cells. All examples of different phenotypes/states in your experiment.

1 is always 1, I actually don't know why :)

Second row (number 2 in red) tells to GSEA Software how your phenotypes are labeled, so the software can have information on what is one and what is another phenotype. This sign "#" must precede the first phenotype label (in the first cell of the second row).

Finally, third row (number 3 in red) provides information to software how many samples of each phenotype are being analyzed. In my example, 3 samples of phenotype U266M and 3 samples of phenotype U266C.

The rest of the steps are the same as in creation of .gct file - first make sure that you save your Excel file of .cls file.

Then save it as Tab delimited text file.

Finally, open the text file and change the extension to .cls. At the end you should have three files - Excel, text and .cls file.

In the Part II I will explain how to perform GSEA using files you created using this tutorial.

Until then, relax and keep steemSTEM! ;)
________

Literature

[1] [Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., ... & Mesirov, J. P. (2005). Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences, 102(43), 15545-15550.](http://software.broadinstitute.org/gsea/doc/subramanian_tamayo_gsea_pnas.pdf)

[2] [GSEA User Guide](http://software.broadinstitute.org/gsea/doc/GSEAUserGuideFrame.html)

[3] [Java heap space / OutOfMemoryError](http://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Known_Issues)

--------------
<center>**For more scientific-related content check [steemSTEM](https://steemit.com/trending/steemstem). Follow me if you like my posts and want to read some more ;) If you have any thoughts/suggestions fell free to leave a comment!**</center>

<center>![scienceangel.png](https://steemitimages.com/DQmVujufXFQsvfNCohqMQ8GH8rp3EYBKUaC3TgtVApwbmNX/scienceangel.png)</center>

<center>**Special thanks to wonderful and incredibly talented @atopy, who authored this amazing artwork for me, make sure to check out her blog!!!**</center>
๐Ÿ‘  , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and 202 others
properties (23)
post_id43,731,741
authorscienceangel
permlinklab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i
categorysteemstem
json_metadata"{"links": ["https://steemit.com/steemstem/@scienceangel/lab-diaries-4-whole-transcriptome-and-proteome-analysis-by-rna-sequencing-and-tmt-ms", "https://en.wikipedia.org/wiki/File:Signal_transduction_pathways.svg", "https://creativecommons.org/licenses/by-sa/3.0/deed.en", "http://software.broadinstitute.org/gsea/msigdb/collections.jsp", "http://software.broadinstitute.org/gsea/doc/GSEAUserGuideFrame.html", "http://software.broadinstitute.org/gsea/doc/subramanian_tamayo_gsea_pnas.pdf", "https://commons.wikimedia.org/wiki/File:Java.png", "http://software.broadinstitute.org/gsea/downloads.jsp", "http://software.broadinstitute.org/cancer/software/gsea/wiki/index.php/Known_Issues", "https://steemit.com/trending/steemstem"], "users": ["atopy"], "tags": ["steemstem", "steemeducation", "science", "biology", "life"], "app": "steemit/0.1", "image": ["https://steemitimages.com/DQmVXY2AwNSYcZCUp9Kvyki9oiFoBaGBgcwRuB8CxQo6rSM/Picture2.png"], "format": "markdown"}"
created2018-04-15 16:24:54
last_update2018-04-15 16:24:54
depth0
children17
net_rshares18,320,723,437,871
last_payout2018-04-22 16:24:54
cashout_time1969-12-31 23:59:59
total_payout_value71.591 SBD
curator_payout_value21.828 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length13,487
author_reputation113,646,366,638,572
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (266)
@alexs1320 · (edited)
$0.08
This is absolutely perfect! The best HowTo that could be found on the internet. Students should learn from this.

I can't wait for the Part III or II depending how you count
๐Ÿ‘  
properties (23)
post_id43,732,596
authoralexs1320
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180415t163103393z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-15 16:31:03
last_update2018-04-15 16:31:24
depth1
children0
net_rshares15,894,828,981
last_payout2018-04-22 16:31:03
cashout_time1969-12-31 23:59:59
total_payout_value0.078 SBD
curator_payout_value0.000 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length173
author_reputation149,050,464,338,619
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (1)
@tking77798 ·
$0.08
I like the new artwork.   @atopy did a great job!

Also, great breakdown of GSEA.  Its a useful tool, but even with its help its difficult to interpret large datasets like this.
๐Ÿ‘  
properties (23)
post_id43,746,799
authortking77798
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180415t182324800z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "users": ["atopy"], "tags": ["steemstem"]}"
created2018-04-15 18:23:27
last_update2018-04-15 18:23:27
depth1
children1
net_rshares15,105,263,098
last_payout2018-04-22 18:23:27
cashout_time1969-12-31 23:59:59
total_payout_value0.057 SBD
curator_payout_value0.018 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length177
author_reputation14,491,425,587,942
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (1)
@scienceangel ·
$0.07
Exactly, it's always challenging finding the answers from these type of datasets. As a matter of fact, I'm still dealing with this :)
๐Ÿ‘  
properties (23)
post_id43,748,049
authorscienceangel
permlinkre-tking77798-re-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180415t183342163z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-15 18:33:39
last_update2018-04-15 18:33:39
depth2
children0
net_rshares14,786,769,548
last_payout2018-04-22 18:33:39
cashout_time1969-12-31 23:59:59
total_payout_value0.054 SBD
curator_payout_value0.018 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length133
author_reputation113,646,366,638,572
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (1)
@vanessahampton ·
>In each of our cells there are many, many (and one more time - many!) signal transduction pathways that include thousands of proteins, 

lol, the one more time. True
properties (22)
post_id43,771,420
authorvanessahampton
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180415t223422371z
categorysteemstem
json_metadata"{"app": "busy/2.4.0", "community": "busy", "tags": ["steemstem"]}"
created2018-04-15 22:34:36
last_update2018-04-15 22:34:36
depth1
children0
net_rshares0
last_payout2018-04-22 22:34:36
cashout_time1969-12-31 23:59:59
total_payout_value0.000 SBD
curator_payout_value0.000 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length166
author_reputation3,362,533,862,951
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
@lordneroo ·
$0.08
Hey @scienceangel, thank you for your contributions! This is a wonderful blog.

> In each of our cells there are many, many (and one more time - many!) signal transduction pathways that include thousands of proteins, which are tightly regulated to keep our cells alive and functioning. In cancer cells, due to the activation of oncogenes, many of those pathways are deregulated to provide advantages to cancer cells over normal cells.

So does that basically mean that the more protein we consume the more dangerous it can be for our health? (Don't get mad at me if that's a dumb question :P)

Thank you so much for providing detailed information regarding the use of the GSEA software. Personally I'm an Engineer so I will never get to use this software, however I do appreciate you taking the time to create this tutorial as I know how vital this can be for young scientists in your field. (We all depend on software and tutorials :P)

Please keep up the great work!
๐Ÿ‘  
properties (23)
post_id43,772,915
authorlordneroo
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180415t225042862z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "users": ["scienceangel"], "tags": ["steemstem"]}"
created2018-04-15 22:50:45
last_update2018-04-15 22:50:45
depth1
children3
net_rshares15,630,148,820
last_payout2018-04-22 22:50:45
cashout_time1969-12-31 23:59:59
total_payout_value0.060 SBD
curator_payout_value0.018 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length968
author_reputation213,796,208,950,223
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (1)
@scienceangel ·
$0.05
Thank you very much!

Actually, it is a very good question.

It is quite normal that we ingest a lot of things on daily basis that are not necessary/in excess for our organism/cells at given moment - too much water, salt, carbs, fats, proteins, etc. Our cells of course have mechanisms of keeping homeostasis of intracellular environment, so when we ingest too much proteins for example, they will be digested in our gastrointestinal tract down to amino acids, which then hit the bloodstream. Cells will take up the amount of amino acids they need (especially muscle cells), and the rest/excess amino acids will be broken down in the liver to form ammonia. The liver converts the ammonia into urea, because ammonia is toxic. At last, urea is excreted from the body through the kidneys.
So when you ingest more proteins than your cells need, it will be just more work for your liver and kidneys :)
There are studies connecting red/processed meat intake with the increased risk of bowel cancer. This is however, not due to increased protein intake, but due to carcinogens found in processed meat, and if we are talking about red, non-processed meat, some evidence suggest that chemicals formed during digestion may damage the cells that line the bowel. Other causes may be the fat content, and the way it is processed or cooked; or just because people who eat preferentially red meat usually have low intake of "protective foods" such as fruit and vegetables or wholegrain cereals.
So take-home advice - make sure that you eat everything in moderate quantities (balanced diet), have plenty of physical activity, avoid smoking and drinking too much alcohol, and you'll be fine ;)
๐Ÿ‘  ,
properties (23)
post_id43,818,514
authorscienceangel
permlinkre-lordneroo-re-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180416t062142226z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-16 06:21:42
last_update2018-04-16 06:21:42
depth2
children2
net_rshares9,220,311,809
last_payout2018-04-23 06:21:42
cashout_time1969-12-31 23:59:59
total_payout_value0.039 SBD
curator_payout_value0.006 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length1,676
author_reputation113,646,366,638,572
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (2)
@lordneroo ·
Hey, thank you SO much for taking the time to leave such a detailed response, I truly appreciate that!

So even excessive water consumption can harm our health? I've been drinking TOO much water for years!! Should I worry?

Again, thank you so much for providing such deep information in this thread! Your blog was spectacular to say the least, but you also wrote another mini post in response to my comment!

Stay awesome!
properties (22)
post_id43,869,600
authorlordneroo
permlinkre-scienceangel-re-lordneroo-re-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180416t134259473z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-16 13:43:00
last_update2018-04-16 13:43:00
depth3
children1
net_rshares0
last_payout2018-04-23 13:43:00
cashout_time1969-12-31 23:59:59
total_payout_value0.000 SBD
curator_payout_value0.000 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length423
author_reputation213,796,208,950,223
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
@qiyi ·
$0.08
Interesting!  Looks like we're in the same field of biomedical science and cellular/molecular biology.  You have a new follower!
๐Ÿ‘  
properties (23)
post_id43,775,358
authorqiyi
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180415t232003764z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-15 23:20:06
last_update2018-04-15 23:20:06
depth1
children0
net_rshares15,372,509,004
last_payout2018-04-22 23:20:06
cashout_time1969-12-31 23:59:59
total_payout_value0.057 SBD
curator_payout_value0.018 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length128
author_reputation1,728,931,463,548
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (1)
@effofex ·
$0.25
Oh wow, this is prompting me to get my butt off Steemit during lunch and do some sideline research.  It seems like gene sets have a pretty clear analog in 'functional guilds' in microbio.  I wonder if anyone has translated the mathematics over into ecology.
๐Ÿ‘  ,
properties (23)
post_id43,899,162
authoreffofex
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180416t171839830z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-16 17:18:42
last_update2018-04-16 17:18:42
depth1
children4
net_rshares49,812,117,431
last_payout2018-04-23 17:18:42
cashout_time1969-12-31 23:59:59
total_payout_value0.243 SBD
curator_payout_value0.006 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length257
author_reputation14,417,464,439,057
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (2)
@alexs1320 ·
In ecology - they need us :D 

The most popular is an easy matrix algebra, "game of life" simulation
Or they measure various parameters and need some multivariate analysis to obtain the conclusion 

And the latest fashion and the closest thing to your question is:  [environmental DNA (eDNA)](https://www.sciencedirect.com/science/article/pii/S0006320714004443)
๐Ÿ‘  
properties (23)
post_id43,900,291
authoralexs1320
permlinkre-effofex-re-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180416t172723617z
categorysteemstem
json_metadata"{"links": ["https://www.sciencedirect.com/science/article/pii/S0006320714004443"], "app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-16 17:27:24
last_update2018-04-16 17:27:24
depth2
children3
net_rshares1,542,965,247
last_payout2018-04-23 17:27:24
cashout_time1969-12-31 23:59:59
total_payout_value0.000 SBD
curator_payout_value0.000 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length361
author_reputation149,050,464,338,619
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (1)
@effofex ·
$0.05
I hear what you're saying, but ecological modelling has progressed far beyond Conway's game of life.  
* LTI systems modelling migration behavior
* various ODE and PDE models of population dyanmics
* ecological network inference and modelling (including keystone species prediction and syntrophic pair identification)

etc
๐Ÿ‘  
properties (23)
post_id43,901,598
authoreffofex
permlinkre-alexs1320-re-effofex-re-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180416t173817528z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-16 17:38:21
last_update2018-04-16 17:38:21
depth3
children2
net_rshares11,099,898,525
last_payout2018-04-23 17:38:21
cashout_time1969-12-31 23:59:59
total_payout_value0.042 SBD
curator_payout_value0.012 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length322
author_reputation14,417,464,439,057
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (1)
@dailypick ·
This post has been upvoted and picked by [Daily Picked #31](https://steemit.com/curation/@dailypick/dailypick-31-16-april-or-explore-quality-content)! Thank you for the cool and quality content. Keep going! 

Donโ€™t forget Iโ€™m not a robot. I explore, read, upvote and share manually โ˜บ๏ธ
properties (22)
post_id43,918,636
authordailypick
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180416t195403216z
categorysteemstem
json_metadata"{"links": ["https://steemit.com/curation/@dailypick/dailypick-31-16-april-or-explore-quality-content"], "app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-16 19:54:03
last_update2018-04-16 19:54:03
depth1
children0
net_rshares0
last_payout2018-04-23 19:54:03
cashout_time1969-12-31 23:59:59
total_payout_value0.000 SBD
curator_payout_value0.000 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length284
author_reputation484,792,123,942
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
@steemitboard ·
Congratulations @scienceangel! You have completed some achievement on Steemit and have been rewarded with new badge(s) :

[![](https://steemitimages.com/70x80/http://steemitboard.com/notifications/voted.png)](http://steemitboard.com/@scienceangel) Award for the number of upvotes received

Click on any badge to view your own Board of Honor on SteemitBoard.
For more information about SteemitBoard, click [here](https://steemit.com/@steemitboard)

If you no longer want to receive notifications, reply to this comment with the word `STOP`

> Upvote this notification to help all Steemit users. Learn why [here](https://steemit.com/steemitboard/@steemitboard/http-i-cubeupload-com-7ciqeo-png)!
properties (22)
post_id43,922,183
authorsteemitboard
permlinksteemitboard-notify-scienceangel-20180416t202612000z
categorysteemstem
json_metadata"{"image": ["https://steemitboard.com/img/notifications.png"]}"
created2018-04-16 20:26:12
last_update2018-04-16 20:26:12
depth1
children0
net_rshares0
last_payout2018-04-23 20:26:12
cashout_time1969-12-31 23:59:59
total_payout_value0.000 SBD
curator_payout_value0.000 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length692
author_reputation38,705,954,145,809
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
@mountain.phil28 · (edited)
$0.15
Hell yah, that's one comprehensive and specific guideline. :) 
Unfortunatelly my recent biochem work does not need me to use it. -> But provides me with a lot of other problems ofc. Haha!
๐Ÿ‘  ,
properties (23)
post_id44,689,957
authormountain.phil28
permlinkre-scienceangel-lab-diaries-5-gene-set-enrichment-analysis-gsea-of-a-large-scale-biological-data-part-i-20180421t143408874z
categorysteemstem
json_metadata"{"app": "steemit/0.1", "tags": ["steemstem"]}"
created2018-04-21 14:34:09
last_update2018-04-21 14:34:24
depth1
children0
net_rshares24,373,132,376
last_payout2018-04-28 14:34:09
cashout_time1969-12-31 23:59:59
total_payout_value0.118 SBD
curator_payout_value0.034 SBD
pending_payout_value0.000 SBD
promoted0.000 SBD
body_length187
author_reputation7,451,131,400,254
root_title"Lab Diaries #5 - Gene Set Enrichment Analysis (GSEA) of a Large-Scale Biological Data, Part I"
beneficiaries[]
max_accepted_payout1,000,000.000 SBD
percent_steem_dollars10,000
author_curate_reward""
vote details (2)