journal article Open Access Jun 30, 2020

Small-Angle Scattering and Multifractal Analysis of DNA Sequences

View at Publisher Save 10.3390/ijms21134651
Abstract
The arrangement of A, C, G and T nucleotides in large DNA sequences of many prokaryotic and eukaryotic cells exhibit long-range correlations with fractal properties. Chaos game representation (CGR) of such DNA sequences, followed by a multifractal analysis, is a useful way to analyze the corresponding scaling properties. This approach provides a powerful visualization method to characterize their spatial inhomogeneity, and allows discrimination between mono- and multifractal distributions. However, in some cases, two different arbitrary point distributions, may generate indistinguishable multifractal spectra. By using a new model based on multiplicative deterministic cascades, here it is shown that small-angle scattering (SAS) formalism can be used to address such issue, and to extract additional structural information. It is shown that the box-counting dimension given by multifractal spectra can be recovered from the scattering exponent of SAS intensity in the fractal region. This approach is illustrated for point distributions of CGR data corresponding to Escherichia coli, Phospholamban and Mouse mitochondrial DNA, and it is shown that for the latter two cases, SAS allows extraction of the fractal iteration number and the scaling factor corresponding to “ACGT” square, or to recover the number of bases. The results are compared with a model based on multiplicative deterministic cascades, and respectively with one which takes into account the existence of forbidden sequences in DNA. This allows a classification of the DNA sequences in terms of random and deterministic fractals structures emerging in CGR.
Topics

No keywords indexed for this article. Browse by subject →

References
54
[1]
Arneodo "Multi-scale coding of genomic information: From DNA sequence to genome structure and function" Phys. Rep. (2011) 10.1016/j.physrep.2010.10.001
[2]
Controlling the double helix

Gary Felsenfeld, Mark Groudine

Nature 2003 10.1038/nature01411
[3]
"Fractal genome sequences" Gene (2012) 10.1016/j.gene.2012.01.090
[4]
Albuquerque "DNA-based nanobiostructured devices: The role of quasiperiodicity and correlation effects" Phys. Rep. (2014) 10.1016/j.physrep.2013.10.004
[5]
Niu "Predicting DNA binding proteins using support vector machine with hybrid fractal features" J. Theor. Biol. (2014) 10.1016/j.jtbi.2013.10.009
[6]
Lennon "Lung cancer—A fractal viewpoint" Nat. Rev. Clin. Oncol. (2015) 10.1038/nrclinonc.2015.108
[7]
Babic, M., Mihelic, J., and Calì, M. (2020). Complex Network Characterization Using Graph Theory and Fractal Geometry: The Case Study of Lung Cancer DNA Sequences. Appl. Sci., 10. 10.3390/app10093037
[8]
Mutual information functions versus correlation functions

Wentian Li

Journal of Statistical Physics 1990 10.1007/bf01025996
[9]
Evolution of long-range fractal correlations and 1/fnoise in DNA base sequences

Richard F. Voss

Physical Review Letters 1992 10.1103/physrevlett.68.3805
[10]
Havlin "Statistical and linguistic features of DNA sequences" Fractals (1995) 10.1142/s0218348x95000229
[11]
Measuring correlations in symbol sequences

Hanspeter Herzel, Ivo Große

Physica A: Statistical Mechanics and its Applicati... 1995 10.1016/0378-4371(95)00104-f
[12]
Oliver "Compositional segmentation and long-range fractal correlations in DNA sequences" Phys. Rev. E (1996) 10.1103/physreve.53.5181
[13]
Arneodo "Wavelet based fractal analysis of DNA sequences" Phys. D Nonlinear Phenom. (1996) 10.1016/0167-2789(96)00029-2
[14]
Jeffrey "Chaos game representation of gene structure" Nucleic Acids Res. (1990) 10.1093/nar/18.8.2163
[15]
Hoang "Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison" Genomics (2016) 10.1016/j.ygeno.2016.08.002
[16]
Rodriguez "Multifractal analysis of DNA sequences using a novel chaos-game representation" Phys. A Stat. Mech. Appl. (2001) 10.1016/s0378-4371(01)00333-8
[17]
Han "Wavelet-based multifractal analysis of DNA sequences by using chaos-game representation" Chin. Phys. B (2010) 10.1088/1674-1056/19/1/010205
[18]
Yu "Chaos game representation of protein sequences based on the detailed HP model and their multifractal and correlation analyses" J. Theor. Biol. (2004) 10.1016/j.jtbi.2003.09.009
[19]
Long "Chaos game representation of functional protein sequences, and simulation and multifractal analysis of induced measures" Chin. Phys. B (2010) 10.1088/1674-1056/19/6/068701
[20]
Pal "Multifractal detrended cross-correlation analysis of genome sequences using chaos-game representation" Phys. A Stat. Mech. Appl. (2016) 10.1016/j.physa.2016.03.074
[21]
Zaia, A., Maponi, P., Zannotti, M., and Casoli, T. (2020). Biocomplexity and Fractality in the Search of Biomarkers of Aging and Pathology: Mitochondrial DNA Profiling of Parkinson’s Disease. Int. J. Mol. Sci., 21. 10.3390/ijms21051758
[22]
Feigin, L.A., and Svergun, D.I. (1987). Structure Analysis by Small-Angle X-Ray and Neutron Scattering, Springer. 10.1007/978-1-4757-6624-0
[23]
Martin "Scattering from fractals" J. Appl. Cryst. (1987) 10.1107/s0021889887087107
[24]
Schmidt "Small-angle scattering studies of disordered, porous and fractal systems" J. Appl. Cryst. (1991) 10.1107/s0021889891003400
[25]
Cherny "Deterministic fractals: Extracting additional information from small-angle scattering data" Phys. Rev. E (2011) 10.1103/physreve.84.036203
[26]
Anitas "Structural characterization of chaos game fractals using small-angle scattering analysis" PLoS ONE (2017) 10.1371/journal.pone.0181385
[27]
Debye "Zerstreuung von Röntgenstrahlen" Ann. Phys. (1915) 10.1002/andp.19153510606
[28]
Provata "Fractal Cantor Patterns in the Sequence Structure of DNA" Fractals (2000) 10.1142/s0218348x00000044
[29]
Barnsley, M.F. (2000). Fractals Everywhere, Morgan Kaufmann. [2nd ed.].
[30]
Rogers, C.A. (1970). Hausdorff Measures, Cambridge University Press.
[31]
(1918). Dimension und äußeres Maß. Mathematische Annalen, 79, 157–179. 10.1007/bf01457179
[32]
Gouyet, J.F. (1996). Physics and Fractal Structures, Masson.
[33]
Arneodo "A wavelet-based method for multifractal image analysis. I. Methodology and test applications on isotropic and anisotropic random rough surfaces" Eur. Phys. J. B (2000) 10.1007/s100510051161
[34]
Decoster "A wavelet-based method for multifractal image analysis. II. Applications to synthetic multifractal rough surfaces" Eur. Phys. J. B (2000) 10.1007/s100510051179
[35]
Muzy "Multifractal formalism for fractal signals: The structure-function approach versus the wavelet-transform modulus-maxima method" Phys. Rev. E (1993) 10.1103/physreve.47.875
[36]
Chhabra "Direct Determination of the f (alpha) Singularity Spectrum" Phys. Rev. Lett. (1989) 10.1103/physrevlett.62.1327
[37]
Pantos "Simulation of small-angle scattering from large assemblies of multi-type scatterer particle" J. Mol. Struct. (1996) 10.1016/s0022-2860(96)09302-7
[38]
Meakin "Diffusion-limited aggregation on multifractal lattices: A model for fluid-fluid displacement in porous media" Phys. Rev. A (1987) 10.1103/physreva.36.2833
[39]
Martinez, V.J., Jones, B.J.T., Dominguez-Tenreiro, R., and van de Weygaert, R. (1990). Clustering Paradigms and Multifractal Measures. Astrophys. J., 357. 10.1086/168890
[40]
Tarquis "Multifractal analysis of tori destruction in a molecular Hamiltonian system" Phys. Rev. E (2001) 10.1103/physreve.65.016213
[41]
Hao "Avoided Strings in Bacterial Complete Genomes and a Related Combinatorial Problem. Annals of Combinatorics" Ann. Comb. (2000) 10.1007/pl00001279
[42]
Hao "Fractals related to long DNA sequences and complete genomes" Chaos Solitons Fract. (2000) 10.1016/s0960-0779(98)00182-9
[43]
Hao "Factorizable language: from dynamics to bacterial complete genomes" Physica A (2000) 10.1016/s0378-4371(00)00411-8
[44]
Yang "DNA Sequences with Forbidden Words and the Generalized Cantor Set" J. Appl. Math. Phys. (2019) 10.4236/jamp.2019.78115
[45]
Cherny "Scattering from generalized Cantor fractals" J. Appl. Cryst. (2010) 10.1107/s0021889810014184
[46]
Anitas, E.M. (2020). Small-Angle Scattering from Fractals: Differentiating between Various Types of Structures. Symmetry, 12. 10.3390/sym12010065
[47]
NCBI (2020, June 29). PLN Phospholamban [Homo Sapiens (Human)], Available online: https://www.ncbi.nlm.nih.gov/gene/5350.
[48]
Bibb "Sequence and gene organization of mouse mitochondrial DNA" Cell (1981) 10.1016/0092-8674(81)90300-7
[49]
Brooks "Non-O157 Shiga toxin-producing Escherichia coli infections in the United States" J. Infect. Dis. (2005) 10.1086/466536
[50]
Nakamura "Differential dynamics and impacts of prophages and plasmids on the pangenome and virulence factor repertoires of Shiga toxin-producing Escherichia coli O145:H28" Microb. Genom. (2020)

Showing 50 of 54 references

Metrics
11
Citations
54
References
Details
Published
Jun 30, 2020
Vol/Issue
21(13)
Pages
4651
License
View
Cite This Article
Eugen Mircea Anitas (2020). Small-Angle Scattering and Multifractal Analysis of DNA Sequences. International Journal of Molecular Sciences, 21(13), 4651. https://doi.org/10.3390/ijms21134651
Related

You May Also Like