Data science

honggarae 06/01/2023 441

Definition

A cross-discipline, combined with scientific methods, systems and processes of statistics, information science and computer science, providing phenomena by structured or unstructured data Insight.

Data science

Development History

Since the first general purpose computer in 1946, the experimental simulation of scientific research has produced a large amount of data and discovered the laws of the algorithm. When Ma Yun issued the "DT Age" from the "DT Age", it is the development of data science.

1974, a famous computer scientist, Tuling Award winner Peter Naur In its preface to the "computer method's concise investigation ( Concise Survey of Computer Methods )" The concept of data science was explicitly proposed. "Data science is a data-based science" and provides data science and data ​​b> (DATALOGY) - The former is the science of Data (problem), while the latter focuses on data processing and its application in the field of education (The Science of Data and Of Data Processes and ITS Place in Education.

Peter Naur clearly proposes the concept of data science, data scientific research has experienced a long silence. Until 2001, William S. Cleveland, working at Bell Laboratory issued a "Data Science - Data Science - Data Science: An Action Plan for Expanding The Technical Areas) on the Academic Journal of International Statistical Review The paper of the Field Of Statistics, advocates data science is an important research direction of statistics. Data science is again concerned by statistically. After 2013, Mattmann CA and Dhar V in "Nature ( Nature < / i>) "and" Items on the US Computer Society Communication ( Communications of the ACM ) "to" calculate "calculation - data science ( Computing: a Vision for Data Science < / i>) "Data Science and Prediction ( Data Science and Prediction )" Papers, discuss data science connotation from the perspective of computer science and technology, so that data science is included in computer science and technology Category. However, data science is concerned about because there have been three iconic events later: First, Patil DJ and Davenport Tices issued on Harvard Business Review in Harvard Business Reviews "Data Scientists - the most sexy in the 21st century Occupation ( Data Scientist: THE Sexiest Job of the 21st Century ) "; Second, 2012 big data thinking first applied to the US presidential election, accomplish Obama, defeating Romney, successful, success; The White House established a data scientist in 2015 and hired Patil DJ as the White House's first chief data scientist.

Gartner research and its new technology growth curve (Gartner's 2014 Hype Cycle For Emerging Technologies) The development of data science is close to the end of the innovation and expansion period in July 2014, and it will begin to produce high-stage stage during 2 to 5 years. At the same time, another study of Gartner reveals the data. The growth curve of the scientific itself, as shown in Figure 1. It can be seen from Figure 1 that the maturity of each component of data science is different: R is the highest, and it has been widely used in production activities; Followed by Simulation and simulation, integrated learning, video and image analysis, text analysis, etc., is tending to mature, will be put into practical application; Hadoop-based data discovery may have to disappear; speech analysis, model management, natural language question and answer, etc. have been speculated The period is being treated; the public data science, model factory, algorithm market (economy), standard analysis, etc. are in high-speed development.

Latest: SAS hard disk

Next: finance