Mahmoud Ahmed

Postdoc - Cancer Genomics

Validating a re-implementation of an algorithm to integrate transcriptome and ChIP-seq data


Journal article


Mahmoud Ahmed, Deok Ryong Kim
PeerJ, vol. 11, 2023 Oct, pp. e16318

View PDF View on PeerJ View on ResearchGate
Cite

Cite

APA   Click to copy
Ahmed, M., & Kim, D. R. (2023). Validating a re-implementation of an algorithm to integrate transcriptome and ChIP-seq data. PeerJ, 11, e16318.


Chicago/Turabian   Click to copy
Ahmed, Mahmoud, and Deok Ryong Kim. “Validating a Re-Implementation of an Algorithm to Integrate Transcriptome and ChIP-Seq Data.” PeerJ 11 (October 2023): e16318.


MLA   Click to copy
Ahmed, Mahmoud, and Deok Ryong Kim. “Validating a Re-Implementation of an Algorithm to Integrate Transcriptome and ChIP-Seq Data.” PeerJ, vol. 11, Oct. 2023, p. e16318.


BibTeX   Click to copy

@article{ahmed2023a,
  title = {Validating a re-implementation of an algorithm to integrate transcriptome and ChIP-seq data},
  year = {2023},
  month = oct,
  journal = {PeerJ},
  pages = {e16318},
  volume = {11},
  author = {Ahmed, Mahmoud and Kim, Deok Ryong},
  month_numeric = {10}
}

Abstract

Transcription factor binding to a gene regulatory region induces or represses its expression. Binding and expression target analysis (BETA) integrates the binding and gene expression data to predict this function. First, the regulatory potential of the factor is modeled based on the distance of its binding sites from the transcription start sites in a decay function. Then the differential expression statistics from an experiment where this factor was perturbed represent the binding effect. The rank product of the two values is employed to order in importance. This algorithm was originally implemented in Python. We reimplemented the algorithm in R to take advantage of existing data structures and other tools for downstream analyses. Here, we attempted to replicate the findings in the original BETA paper. We applied the new implementation to the same datasets using default and varying inputs and cutoffs. We successfully replicated the original results. Moreover, we showed that the method was appropriately influenced by varying the input and was robust to choices of cutoffs in statistical testing.