Normalization for two bulk RNA-Seq samples to enable reliable fold-change estimation between...

Why is working on the same position for more than 15 years not a red flag?

A starship is travelling at 0.9c and collides with a small rock. Will it leave a clean hole through, or will more happen?

How can animals be objects of ethics without being subjects as well?

awk + sum all numbers

Which one of these password policies is more secure?

How would an AI self awareness kill switch work?

Why do neural networks need so many training examples to perform?

How can I deliver in-universe written lore to players without it being dry exposition?

How long is the D&D Starter Set campaign?

Incorporating research and background: How much is too much?

Strange Sign on Lab Door

Finding a mistake using Mayer-Vietoris

Can an insurance company drop you after receiving a bill and refusing to pay?

Intern applicant asking for compensation equivalent to that of permanent employee

Why isn't there a non-conducting core wire for high-frequency coil applications

Vertical alignment of rbrace

How to deal with an incendiary email that was recalled

Can I write a book of my D&D game?

What is the purpose of easy combat scenarios that don't need resource expenditure?

Would a National Army of mercenaries be a feasible idea?

Roman Numerals equation 1

Traveling through the asteriod belt?

Equation with several exponents

Is it a fallacy if someone claims they need an explanation for every word of your argument to the point where they don't understand common terms?



Normalization for two bulk RNA-Seq samples to enable reliable fold-change estimation between genes


Normalization methods with RNA-Seq ERCC spike in?Confirm success or failure RNA-Seq normalizationWhat are the ways to process a list of differentially expressed genes?What methods are available to find a cutoff value for non-expressed genes in RNA-seq?qPCR: Why is fold change and standard deviation calculated after transformation?Order of batch effects removal, data imputation and library size normalization in scRNA-seq dataDetecting differentially expressed genes with foldchange >= 2 and FDR < 0.05Selection of differential expressed genesK mean clustering issueHow to quantile normalization on RNA seq counts













2












$begingroup$


I have two bulk RNA-Seq samples, already tpm-normalized.



I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.



The distribution of the two samples using the common set of genes looks similar:



TPM distribution



However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?



My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.



Do you also have suggestions for the definition of up-regulated genes with these data?



scatter










share|improve this question









$endgroup$

















    2












    $begingroup$


    I have two bulk RNA-Seq samples, already tpm-normalized.



    I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.



    The distribution of the two samples using the common set of genes looks similar:



    TPM distribution



    However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?



    My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.



    Do you also have suggestions for the definition of up-regulated genes with these data?



    scatter










    share|improve this question









    $endgroup$















      2












      2








      2





      $begingroup$


      I have two bulk RNA-Seq samples, already tpm-normalized.



      I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.



      The distribution of the two samples using the common set of genes looks similar:



      TPM distribution



      However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?



      My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.



      Do you also have suggestions for the definition of up-regulated genes with these data?



      scatter










      share|improve this question









      $endgroup$




      I have two bulk RNA-Seq samples, already tpm-normalized.



      I would like to know what is a reasonable normalization procedure to enable downstream log fold-change estimation.



      The distribution of the two samples using the common set of genes looks similar:



      TPM distribution



      However, the two samples have only been tpm-normalized, is it enough to guarantee reliable fold-change estimation? Should I use another normalization procedure, e.g. Quantile Normalization, before comparison?



      My objective is to define a signature using the genes that are up-regulated in Sample1 with respect to Sample0, and vice versa. I'm using log fold-changes, but I'm concerned that their value may be affected by each sample distribution.



      Do you also have suggestions for the definition of up-regulated genes with these data?



      scatter







      rna-seq normalization fold-change






      share|improve this question













      share|improve this question











      share|improve this question




      share|improve this question










      asked 2 hours ago









      gc5gc5

      721216




      721216






















          2 Answers
          2






          active

          oldest

          votes


















          2












          $begingroup$

          It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.






          share|improve this answer









          $endgroup$













          • $begingroup$
            I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
            $endgroup$
            – gc5
            28 mins ago



















          2












          $begingroup$

          What I have generally done in the past is to process the data using voom in the limma package for bulk RNASeq. Inside voom you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.



          voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.



          Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
          It is a very thorough introduction to the package and all of its capabilities.



          Good luck!






          share|improve this answer









          $endgroup$













            Your Answer





            StackExchange.ifUsing("editor", function () {
            return StackExchange.using("mathjaxEditing", function () {
            StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
            StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
            });
            });
            }, "mathjax-editing");

            StackExchange.ready(function() {
            var channelOptions = {
            tags: "".split(" "),
            id: "676"
            };
            initTagRenderer("".split(" "), "".split(" "), channelOptions);

            StackExchange.using("externalEditor", function() {
            // Have to fire editor after snippets, if snippets enabled
            if (StackExchange.settings.snippets.snippetsEnabled) {
            StackExchange.using("snippets", function() {
            createEditor();
            });
            }
            else {
            createEditor();
            }
            });

            function createEditor() {
            StackExchange.prepareEditor({
            heartbeatType: 'answer',
            autoActivateHeartbeat: false,
            convertImagesToLinks: false,
            noModals: true,
            showLowRepImageUploadWarning: true,
            reputationToPostImages: null,
            bindNavPrevention: true,
            postfix: "",
            imageUploader: {
            brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
            contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
            allowUrls: true
            },
            onDemand: true,
            discardSelector: ".discard-answer"
            ,immediatelyShowMarkdownHelp:true
            });


            }
            });














            draft saved

            draft discarded


















            StackExchange.ready(
            function () {
            StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fbioinformatics.stackexchange.com%2fquestions%2f7142%2fnormalization-for-two-bulk-rna-seq-samples-to-enable-reliable-fold-change-estima%23new-answer', 'question_page');
            }
            );

            Post as a guest















            Required, but never shown

























            2 Answers
            2






            active

            oldest

            votes








            2 Answers
            2






            active

            oldest

            votes









            active

            oldest

            votes






            active

            oldest

            votes









            2












            $begingroup$

            It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.






            share|improve this answer









            $endgroup$













            • $begingroup$
              I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
              $endgroup$
              – gc5
              28 mins ago
















            2












            $begingroup$

            It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.






            share|improve this answer









            $endgroup$













            • $begingroup$
              I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
              $endgroup$
              – gc5
              28 mins ago














            2












            2








            2





            $begingroup$

            It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.






            share|improve this answer









            $endgroup$



            It's not a good idea to do tpm normalisation prior to differential expression analysis, because the actual read counts are useful to determine shot noise and statistical significance. DESeq2 includes read normalisation as part of its methods for differential expression analysis.







            share|improve this answer












            share|improve this answer



            share|improve this answer










            answered 1 hour ago









            gringergringer

            7,79221049




            7,79221049












            • $begingroup$
              I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
              $endgroup$
              – gc5
              28 mins ago


















            • $begingroup$
              I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
              $endgroup$
              – gc5
              28 mins ago
















            $begingroup$
            I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
            $endgroup$
            – gc5
            28 mins ago




            $begingroup$
            I agree with TPM for a lot of reasons, unfortunately the data was already in TPM. Can you explain more about how read counts are useful to determine shot noise and statistical significance? Thanks
            $endgroup$
            – gc5
            28 mins ago











            2












            $begingroup$

            What I have generally done in the past is to process the data using voom in the limma package for bulk RNASeq. Inside voom you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.



            voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.



            Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
            It is a very thorough introduction to the package and all of its capabilities.



            Good luck!






            share|improve this answer









            $endgroup$


















              2












              $begingroup$

              What I have generally done in the past is to process the data using voom in the limma package for bulk RNASeq. Inside voom you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.



              voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.



              Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
              It is a very thorough introduction to the package and all of its capabilities.



              Good luck!






              share|improve this answer









              $endgroup$
















                2












                2








                2





                $begingroup$

                What I have generally done in the past is to process the data using voom in the limma package for bulk RNASeq. Inside voom you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.



                voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.



                Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
                It is a very thorough introduction to the package and all of its capabilities.



                Good luck!






                share|improve this answer









                $endgroup$



                What I have generally done in the past is to process the data using voom in the limma package for bulk RNASeq. Inside voom you can call for different normalization methods to be used - "TMM" works fine for me and, is advocated by many in the field.



                voom will output an object containing the normalized expression values in a log2 scale, which, also in my experience, has worked out just fine for calculating log fold changes.



                Check out this link for more info on the package as well as normalization methods: https://www.bioconductor.org/packages/devel/workflows/vignettes/RNAseq123/inst/doc/limmaWorkflow.html
                It is a very thorough introduction to the package and all of its capabilities.



                Good luck!







                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered 1 hour ago









                h3ab74h3ab74

                836




                836






























                    draft saved

                    draft discarded




















































                    Thanks for contributing an answer to Bioinformatics Stack Exchange!


                    • Please be sure to answer the question. Provide details and share your research!

                    But avoid



                    • Asking for help, clarification, or responding to other answers.

                    • Making statements based on opinion; back them up with references or personal experience.


                    Use MathJax to format equations. MathJax reference.


                    To learn more, see our tips on writing great answers.




                    draft saved


                    draft discarded














                    StackExchange.ready(
                    function () {
                    StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fbioinformatics.stackexchange.com%2fquestions%2f7142%2fnormalization-for-two-bulk-rna-seq-samples-to-enable-reliable-fold-change-estima%23new-answer', 'question_page');
                    }
                    );

                    Post as a guest















                    Required, but never shown





















































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown

































                    Required, but never shown














                    Required, but never shown












                    Required, but never shown







                    Required, but never shown







                    Popular posts from this blog

                    Benedict Cumberbatch Contingut Inicis Debut professional Premis Filmografia bàsica Premis i...

                    Monticle de plataforma Contingut Est de Nord Amèrica Interpretacions Altres cultures Vegeu...

                    Escacs Janus Enllaços externs Menú de navegacióEscacs JanusJanusschachBrainKing.comChessV