什么是数据界(What is Datanature?)

ABSTRACT

Data are everything in Cyberspace.  Data are the reflections of nature and human behaviors, Datanature (all  data in Cyberspace) is forming unconsciously and developing. Meanwhile,  more data without references in nature, like computer viruses, some  network games and junk data, are generated in Datanature. Datanature  would gradually cover and exceed the facts in nature and have its own  patterns, not just overlaps with nature and reflects the laws in nature.  Nowadays, people always derive majority knowledge from Datanature and  only minority by their own experience in nature. There would be new  specific phenomena and patterns hidden in Datanature which differ from  those of the nature or human behaviors. For example, the history of data  (what is the first data), the evolution of data from structured data to  unstructured data, or the development of network game, or the formation  of data networks, data tribes and data countries.

At the beginning our culture and  society are built on nature, and then computer science and technology  help people store both culture and society and nature into computer  systems when the computer was invented. Culture and society have been  being built on both nature and Datanature. Furthermore, our culture and  society will rely increasingly on Datanature. In Datanature, we will  face many new problems. For example, if one asks how many papers are  related to DNA in Science, it is easy to answer; if he asks how many  fields are studied in science, it is difficult to answer; if he asks  which paper is associated with each other in Science or what is  indicated by all the papers in Science, it is more difficult to answer.  Therefore, we say that Science itself is a valuable research topic.  Science has been digitized and stored into computer systems in the form  of data. It forms a data tribe or region like human tribe or region  in nature.

Various datasets (data tribe or  region) in Datanature are organized as a type of Datanetwork. A  typically example is that online social networks and communities are  formed by human behaviors in Datanature. Small World Theory in nature  has been testing and verifying in Datanature. Therefore, the study on  Datanetwork is one of issues in Datanature research. That is,  traditional methods and theories developed in nature are being applied,  transferred and extended in Datanature including classical complex  network theory and graph mining algorithms.

Datanature is exceeding the facts  in nature. Consequently, it is necessary to research on data in  cyberspace. Data Science includes two key connotations. One is to  provide a kind of novel research methods, called Scientific Research  Method with Data, for nature science and social science, also referred  to as Data-intensive Research Method; The other is to Research JUST on  Data, i.e., study the phenomena and laws of Datanature including the  history of data (what is the first data), the formation of Datanetworks,  data tribes and data countries (like Google), the transformation and  evolution of data, type of data, properties of data, etc.

Keywords: Data Science, Datanature, Datanetwork

(引用标注: Yangyong Zhu, Yun Xiong. What is Datanature? [TR][OL]. 2009. available at: http://www.dataology.fudan.edu.cn/)




友情链接
联系我们
地址: 中国 上海市浦东新区张衡路825号复旦大学张江校区计算机楼
邮编: 201203
电话: +86-21-51355518 / +86-21-51355100
传真: +86-21-51355100
E-mail: dataology@fudan.edu.cn