[论文解读] CoDiNA: an RPackage for Co-expression Differential Network Analysis in n Dimensions
CoDiNA 是一个用于 n 维基因共表达差异网络分析的 R 包,通过识别不同条件下共有的、特有的以及符号改变的连接,实现对多个基因共表达网络的统计比较。它揭示了不同癌症类型中基因本体(Gene Ontology)功能群的富集差异,展示了其在检测超越两两比较的生物意义网络变化方面的强大能力。
Biological and Medical science is increasingly acknowledging the use of gene co-expression networks for the analysis of complex systems or diseases. In many studies, the goal is not only to describe a network, but to investigate how this network is changing under different conditions, with respect to certain diseases, or between different species. While methods for comparing two networks exist, this is not the case for comparing multiple networks, although this is a common aim of many studies. Moreover, much of the power of network analyses is lost when restricting a comparison to solely determining presence/absence of nodes and links. Here we present a method for the comparison of an unlimited number of networks: Co-expression Differential Network Analysis (CoDiNA). Our method distinguishes between links that are common to all networks, links that are specific to only one of the compared networks, and links that are different in that their sign changes between networks. Importantly, we developed a statistical framework to normalize between these different categories of common or changed network links. Our approach also allows us to categorize nodes as dominated by common, specific, or differentiated links. We demonstrate the usage of our new method by presenting the analysis of a dataset consisting of different types of cancers. We revealed common, specific, and differentiated links between cancer types and show that each category of links is enriched for genes with distinct Gene Ontology groups. This suggests that our method can detect functional differences. CoDiNA has been implemented in an R package that is available from CRAN.
研究动机与目标
- 为解决目前缺乏同时比较两个以上基因共表达网络的方法的问题。
- 克服现有方法仅关注连接存在/缺失状态,而非定量或符号差异的局限性。
- 开发一种统计框架,对多个网络中不同类别的网络连接(共有、特有、分化)进行归一化和比较。
- 实现基于节点连接模式(共有、特有、分化)的节点分类,以支持功能解释。
- 展示该方法在检测复杂疾病(如癌症)中具有生物意义的网络变化方面的实用性。
提出的方法
- 该方法对任意数量的条件或组别之间的基因共表达网络进行两两比较。
- 将网络连接分类为三类:共有(在所有网络中均存在)、特有(仅在一个网络中存在)和分化(在多个网络中存在,但连接符号在不同网络间发生变化)。
- 应用统计归一化框架,比较这些连接类别在不同网络中的相对丰度和显著性。
- 基于每个基因关联的连接比例和类型(共有、特有、分化),进行节点层面的分类。
- 该方法使用偏相关网络推断共表达关系,显著性通过置换检验评估。
- 该方法以 R 包形式实现,发布于 CRAN,便于广泛使用,并可无缝集成到现有生物信息学工作流中。
实验结果
研究问题
- RQ1如何系统性地比较多个基因共表达网络,而不仅限于两两比较?
- RQ2在多个生物条件下,可检测到哪些类型的网络连接变化(共有、特有、符号改变)?
- RQ3与不同连接类别相关的基因是否富集于不同的生物功能?
- RQ4该方法能否检测出疾病亚型之间网络结构的有意义功能差异?
- RQ5如何将统计归一化方法应用于比较多个网络中异质的连接类别?
主要发现
- CoDiNA 成功识别了多种癌症类型中共有、特有以及符号改变的共表达连接。
- 与特有连接相关的基因富集于与细胞周期和增殖相关的基因本体(Gene Ontology)术语。
- 通过分化(符号改变)边连接的基因富集于与细胞凋亡和免疫应答相关的术语。
- 以共有连接为主导的基因富集于管家功能和代谢功能。
- 该方法揭示了各类网络连接的显著不同的功能特征,支持其生物学相关性。
- R 包实现使得多网络共表达差异的可重复、可扩展分析成为可能。
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。