Principal Component Analysis and Randomness Test for Big Data Analysis

Principal Component Analysis and Randomness Test for Big Data Analysis
Author :
Publisher : Springer Nature
Total Pages : 153
Release :
ISBN-10 : 9789811939679
ISBN-13 : 9811939675
Rating : 4/5 (79 Downloads)

Book Synopsis Principal Component Analysis and Randomness Test for Big Data Analysis by : Mieko Tanaka-Yamawaki

Download or read book Principal Component Analysis and Randomness Test for Big Data Analysis written by Mieko Tanaka-Yamawaki and published by Springer Nature. This book was released on 2023-05-23 with total page 153 pages. Available in PDF, EPUB and Kindle. Book excerpt: This book presents the novel approach of analyzing large-sized rectangular-shaped numerical data (so-called big data). The essence of this approach is to grasp the "meaning" of the data instantly, without getting into the details of individual data. Unlike conventional approaches of principal component analysis, randomness tests, and visualization methods, the authors' approach has the benefits of universality and simplicity of data analysis, regardless of data types, structures, or specific field of science. First, mathematical preparation is described. The RMT-PCA and the RMT-test utilize the cross-correlation matrix of time series, C = XXT, where X represents a rectangular matrix of N rows and L columns and XT represents the transverse matrix of X. Because C is symmetric, namely, C = CT, it can be converted to a diagonal matrix of eigenvalues by a similarity transformation SCS-1 = SCST using an orthogonal matrix S. When N is significantly large, the histogram of the eigenvalue distribution can be compared to the theoretical formula derived in the context of the random matrix theory (RMT, in abbreviation). Then the RMT-PCA applied to high-frequency stock prices in Japanese and American markets is dealt with. This approach proves its effectiveness in extracting "trendy" business sectors of the financial market over the prescribed time scale. In this case, X consists of N stock- prices of length L, and the correlation matrix C is an N by N square matrix, whose element at the i-th row and j-th column is the inner product of the price time series of the length L of the i-th stock and the j-th stock of the equal length L. Next, the RMT-test is applied to measure randomness of various random number generators, including algorithmically generated random numbers and physically generated random numbers. The book concludes by demonstrating two applications of the RMT-test: (1) a comparison of hash functions, and (2) stock prediction by means of randomness, including a new index of off-randomness related to market decline.


Principal Component Analysis and Randomness Test for Big Data Analysis Related Books

Principal Component Analysis and Randomness Test for Big Data Analysis
Language: en
Pages: 153
Authors: Mieko Tanaka-Yamawaki
Categories: Business & Economics
Type: BOOK - Published: 2023-05-23 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book presents the novel approach of analyzing large-sized rectangular-shaped numerical data (so-called big data). The essence of this approach is to grasp
Places Rated Almanac
Language: en
Pages: 452
Authors: David Savageau
Categories: Philosophy
Type: BOOK - Published: 1993 - Publisher: Prentice Hall

DOWNLOAD EBOOK

This sometimes controversial bestseller, completely updated with all new statistics, is packed with timely facts and unbiased information on more than 300 metro
Principal Component Analysis
Language: en
Pages: 283
Authors: I.T. Jolliffe
Categories: Mathematics
Type: BOOK - Published: 2013-03-09 - Publisher: Springer Science & Business Media

DOWNLOAD EBOOK

Principal component analysis is probably the oldest and best known of the It was first introduced by Pearson (1901), techniques ofmultivariate analysis. and dev
Python Data Science Handbook
Language: en
Pages: 609
Authors: Jake VanderPlas
Categories: Computers
Type: BOOK - Published: 2016-11-21 - Publisher: "O'Reilly Media, Inc."

DOWNLOAD EBOOK

For many researchers, Python is a first-class tool mainly because of its libraries for storing, manipulating, and gaining insight from data. Several resources e
Handbook of Research on Big Data Storage and Visualization Techniques
Language: en
Pages: 1078
Authors: Segall, Richard S.
Categories: Computers
Type: BOOK - Published: 2018-01-05 - Publisher: IGI Global

DOWNLOAD EBOOK

The digital age has presented an exponential growth in the amount of data available to individuals looking to draw conclusions based on given or collected infor