Scenarios for geographic information use in the year 2010 suggest great potential to extend the capabilities of scientific researchers, decision-makers, and the public. This potential, however, will only be realized if there are substantial advances in Geographic Information Science, enhancing knowledge of geographic concepts and their computational implementations. To assess the needs for basic research in this emerging science and technology field, a workshop was held at the National Science Foundation January 14-15, 1999. Workshop participants represented a broad range of the disciplines involved in Geographic Information Science and technology. The workshop identified two important research streams: research in basic Geographic Information Science (hereafter, GIScience), and research using geographic information systems (hereafter, GIS). It is imperative that research in these two areas be integrated, as applications motivate the science, and awareness of theory improves applications. Basic research in GIScience has several compelling components. First is software integration, a general problem that needs specific research to solve its geospatial dimensions. Second, scale and resolution are spatial problems that interact with the scales (characteristic lengths) of environmental and social processes and with data quality. Third, process models are a general computing problem, but again geographic applications will require uniquely geographic solutions. And fourth, usability of systems and technologies is also a major component in need of research. In addition, uncertainty and spatial dependence were recognized as important crosscutting research themes. Geographic Information Science is clearly a coherent research field of strategic importance.
Workshop participants agreed that there is an urgent need for a focused investment in Geographic Information Science research, and that the National Science Foundation is the most appropriate US agency to do this. Such an investment is consistent with several important national trends, represented by the President's Information Technology Advisory Committee (PITAC) report, the Administrations FY 2000 Information Technology for the Twenty-First Century (IT2) initiative, and the National Spatial Data Infrastructure. The workshop found that there is a coherent research community poised to make advances in Geographic Information Science if sufficient research support is made available.
The workshop participants made the following recommendations to the National Science Foundation:
Dramatic developments in communication, information and computational technologies alone promise to revolutionize our lives even further. Advances in these fields will change the way science is performed and expand its capabilities dramatically. They will influence the ways we teach and learnperhaps even the way we think. Our scientific adventures are far from over.
Geographic information systems (GIS) and spatial analysis methods are powerful tools for the analysis and synthesis of geographically distributed phenomena, and form a critical component of the information infrastructure for science. Such systems are applicable to a wide variety of problems, including many areas of basic and applied research. Geographic Information Science is an inherently multidisciplinary field that underpins GIS. The GIS software, data, and services industry is estimated at $4.2 billion in the United States alone, and appears to be growing at around 20 percent per year. Geographic Information Science (GIScience) research, and research using Geographic information systems, have been funded by the National Science Foundation through a wide range of programs and other activities, including every Directorate to some degree. However, except for a special solicitation issued in 1987 to establish the National Center for Geographic Information Analysis, no program or special activity has focused explicitly and directly on GIScience and GIS.
In order to explore this situation, a workshop was held at the National Science Foundation on January 14-15, 1999. The goals of the workshop were to explore the relationship of GIScience to existing programs and initiatives at NSF, and to examine the prospects for new initiatives or other activities in the area of Geographic Information Science and Geospatial information. Twenty researchers from outside the Foundation, representing many of the disciplines and fields active in and dependent on GIScience, participated in the workshop, along with many members of the NSF staff . The participants are listed in the Appendix to this report.
In the remainder of this report, we first distinguish two distinct but deeply interconnected research areas: basic research in Geographic Information Science; and research using geographic information systems. We present arguments regarding why research in both areas is critical to the advancement of our knowledge of geographic information. They should both receive special funding emphasis at NSF, and the funding should be awarded to catalyze multidisciplinary research that integrates two research domains where possible. After reviewing four major computing problems that have a role in GIScience research, we outline several trends and opportunities in science, technology, and policy that would make an immediate NSF response very timely. After providing a summary of the community that would be likely to respond to an NSF activity in Geographic Information Science and Geospatial Activities, we close with specific recommendations to the Foundation.
|
A Geologic Perspective |
|
Roy Dokka |
|
Geology will derive substantial benefit from fundamental research in GI Science. It is becoming increasingly clear that achieving a more complete understanding of Earth's geologic systems by both academics and practitioners will require ever increasing degrees of integration of data, insights and the perspectives derived from many subdisciplines. GI Science research is focussed on several key elements that will allow for such integration. Although GI Science and its forerunners have provided a substantial intellectual framework and the practical means to organize and visualize large data sets, widespread implementation has yet to be realized. |
There is wide variability in the levels of adoption of GIS, spatial analysis, and related tools and methods across the environmental and social sciences. Scientific advances often are driven by the availability of both analytical tools for analysis and data required by those tools, and some sciences are missing out on the insights that could be provided through spatially-explicit problem solving enabled by GIS. For example, several impediments to greater use of GIS by geologists have been identified, and are typical of many other fields as well:
Geographic information systems are similar to many statistical packages, in that they are commercial software systems widely used both within and outside the research community. However, there is an important difference between the two areas: statistics has a more universally agreed-upon foundation, whereas there is not yet an equivalent mature foundation for GIS software. Geographic Information Science seeks to provide the theoretical foundation for GIS, just as the discipline of statistics provides foundations for statistical software. Again as in the case of statistics, basic research in GIScience is a legitimate (though young) scholarly enterprise in its own right. But the positive effects of GIScience on the GIS software industry, and on basic and applied research using GIS, are inescapable. And GISs are used in some of our most pressing societal problems, such as crime, health, and disaster response.
Basic research in GIScience addresses complex problems that require multidisciplinary solutions. Fundamental problems in such a field are at risk of falling through the cracks between traditional disciplines, especially in the absence of targeted funding to support it. GIScience may begin with deep ontological questions regarding the nature of space and phenomena in space. Is the concept of space itself different among different fields of study? If there are differences, what are the common elements? Results from studies of spatial cognition and spatial language have rarely been used to build spatial query languages, and work in robotics on objects moving in space over time cannot easily be integrated with work on spatio-temporal databases for moving objects. Even methods of spatial analysis developed in geography and regional science are often difficult to integrate into a GIS framework.
|
Integration |
|
Gio Wiederhold |
|
An important role for geographic information is its capability to link diverse information resources produced by other disciplines. For instance, records of ground-based events are commonly produced using place names, while aerial images are distributed with latitude and longitude keys. Geographic information provides the articulation points, but has to be very carefully and consistently organized in order to be able to serve that role. There is often urgency when data have to be integrated, so that principled representations are essential. Examples occur in disaster response planning, when requirements from the field have to be related with resources and available routes to get there. Automation and completeness in matching can greatly hasten resource delivery and reduce wasted efforts. |
Market forces that promote software integration and interoperability in business have not had as much effect on scientific software, which often has been developed with the narrow needs of a specific research community in mind. Existing tool systems do not always make it easy to respond to new technologies for data collection or processing, and the problem is complicated by uneven levels of technical abilities and training across the disciplines. GIS can serve as a frame for scientific data integration, but there are conceptual impediments to the integration of some scientific models with software. Data fusion is an integration problem, and conflation, the process of combining spatial data from different sources, is also critical. When geospatial data from different sources are combined, it is a challenge to preserve the semantics inherent in the component data sets, unless each was prepared strictly according to a common standard. Data quality for the results of data fusion or conflation may be difficult to characterize, especially with regard to positional accuracy.
Even without formal training in cartography, most people realize that the scale of a paper map influences the amount of detail that can be portrayed. But they may not realize how pervasive the influences of scale and resolution are on the analysis and other use of geospatial information in computers. Some of this is a legacy effect, since much geospatial information today was derived from maps. And for remotely sensed imagery, spatial resolution is a characteristic of the design of sensing instruments. Different measurement and positioning technologies will usually produce data with different positional accuracy, capable of resolving different levels of detail in geographic phenomena. A great deal of other geographic information is available not for points but for zones. For example, census data, a cornerstone of much social science research, are spatially aggregated in order to protect the confidentiality of individual records. The aggregation rules are based on a minimum population, and thus are larger in low-density areas and smaller where population density is high. The zones may also change from one census to the next. Simulations have shown that correlations between variables may vary considerably under different aggregation scenarios, calling into some question findings that are based only on analysis of data for the zones that happen to have been selected by the census bureau. Such scale or aggregation effects are not restricted to the social sciences but have been addressed in ecology as well. If basic Geographic Information Science can produce new methods of analysis that can minimize effects of scale and resolution of new and legacy geospatial data, this would have significant benefits for fields conducting research that uses geospatial information.
There is much knowledge embedded in the processes carried out to solve problems. Exploitation of geographical information not only requires having the right maps, but also requires knowing how to use them. The science of describing process knowledge is much less advanced than the science of describing data. For instance, even a simple task such as a search for the nearest restaurant cannot now be expressed effectively in any of the tools available on the Web. For spatial searches, these algorithms are closely linked to geographic representations, and there is a recognized interplay between process models and information representation that is poorly understood. Research, experiments, and development are required to make the wealth of the data that is becoming available fit the tools of the researchers and the public. Some of the key research topics here are spatial dynamic modeling, non-linear transformations, linkage and integration of process models with information systems, computability, and validation. The computability dimension may include heuristics to determine which solution method to use, based on the size and difficulty of the problem.
Usability often refers to issues of human-computer interaction (HCI), user interface design, and training. Indeed, these issues are of central importance, since if the user does not understand the system and its user interface, the system will at best be used inefficiently, and at worst will be used incorrectly and produce invalid results. At the workshop, though, participants put usability in a much broader context. Access is an important factor in the ability of a particular person to use GIS: access to data, access to processing power, access to technology, access to trainingall of these influence system use. More broadly yet, are people aware that systems and data exist at all? If so, how can they find out what data are available, whether they are fit for use? Issues of how users communicate their needs to the system get us back into HCI. Typing and mouse clicks are not the only means of interaction, but the potential of other interaction methods such as sketching, touch screens, and voice have hardly been examined for the geospatial context. The usability issue should also examine more deeply the potential value of collaborative decision-making across distances for spatial decision support. Usability also has a societal context: a single system, based on the same data, should be able to adapt to serve the needs of different sub-populations with different backgrounds and needs.
A different way to motivate basic research is through grand research challenges. The workshop program did not allow time for consensus building regarding such challenges, but in the opening session of the workshop, one of the participants presented four grand challenges for Geographic Information Science. Although they reflect the particular priorities and curiosities of one workshop participant, they are representative of the fundamental scientific questions that will drive GIScience in the next decade.
The central idea here is the challenge of representing the infinite complexity of the real world within the digital computer. The real world is usually thought of as a spatio-temporal continuum, whereas the digital computer has finite capacity, and represents concepts and values in a discrete code. To meet this challenge, GI Science must examine the geographic concepts that are used by environmental and social scientists in their research, including the ontology of reality at geographic scales. GI Science research in this area will be conducted by experts in geographic theory and geographic representations, by domain scientists who study geographically distributed environmental and social phenomena, by knowledge engineers and information scientists, and by philosophers.
The challenge: To find ways to express the infinite complexity of the geographical world in the binary alphabet and limited capacity of a digital computer.
|
Spatial Dependence |
|
Michael Stein |
|
Spatial dependence is central to the analysis and interpretation of all geographic data. Spatial dependence is a feature of both measurement and physical process. Thus, it is not adequate to just attach a standard error to each measurement. One also needs methods for accounting for the spatial dependencies. This is a challenge first of developing appropriate spatial and spatio-temporal models, then developing suitable statistical methodologies and finally, only then, incorporating such models and methodologies into widely available software. |
If the representational challenge cannot fully be met, we must accept that geospatial data include uncertainty. This uncertainty can include measurement error, error due to imperfect interpolation between measurements, gaps (incomplete data), artifacts of graphic or digital processing, and occasional blunders. Or it may be due to the nature of the phenomena themselves, such as the extents of objects with indistinct or graded boundaries. The scientific measurement model is available for some aspects of uncertainty, but strong spatial dependencies complicate the situation considerably. And some spatial processes are essentially stochastic, and thus have an inherent uncertainty component.
The challenge: To find ways of summarizing, modeling, and visualizing the differences between a digital representation and real phenomena.
Although some information comes directly from sensor into spatial databases, much geospatial information has been developed by human operators who use human judgment in their work. Data from maps have been through processes of symbolization, abstraction, and generalization. Thus inclusion of map-based information in GIS means that cognitive concepts are already incorporated into spatial databases. Many spatial or geographic abilities are part of common knowledge or common sense and are characteristics of most people by the time they reach the age of 12. Use of GIS and spatial analysis tools may depend on these abilities, but also on other concepts not learned until graduate school. In some ways, this is similar to the challenge of representation, except here it is the correspondence between binary representations, computational methods, and cognitive concepts that is the focus.
The challenge: To achieve better transitions between cognitive and computational representations and manipulations of geographic information.
It could be argued that one cannot claim to have understood a process if one cannot build a computer simulation of that process whose output cannot be distinguished from data about real-world instances of the process. Part of the popularity and impact of fractal mathematics has been the degree to which graphic renderings of fractal functions simulate, to some degree, landscapes or other natural phenomena. In detail, however, fractal mathematics has little in common with geomorphic processes, and an expert could quickly distinguish fractal terrains from real ones. Successful simulation of geographic phenomena would not only provide a way to confirm process geographic processes, but it also can provide generic data for testing algorithm performance, graphical procedures, and other GIS methods.
The challenge: To create simulations of geographic phenomena in a digital computer that are indistinguishable from their real counterparts.
One additional technological trend that requires a response is the increasing quantity of data being collected and archived. The fact that very large volumes of scientific data were becoming available was already evident in the late 1980s, but a dozen years later, data volumes are drastically higher again. Commercial remotely sensed data will soon be available at 1-meter resolution. Locally-produced data is being registered on the Internet in the clearinghouses of the National Spatial Data Infrastructure, and with GPS and wireless technologies making even more data available, the flood of geospatial data will be spectacular. However, the existence of unimaginable quantities of data does not guarantee that researchers will find needed data more quickly and easily. Just as Coleridge's ancient mariner was thirsty when becalmed in a vast ocean, the current scientists cry could be "Data, data everywhere, but not the information I need." This will be a problem for all kinds of data of interest to scientists, but solutions for geographic information will need focused research efforts. Even if the right data can be found, complex problems need sophisticated tools that may fail to be useful or usable, for reasons noted above.
The visions for 2010 presented in the first part of this report will be difficult to realize if the major research issues and grand research challenges presented in this section are not met head on by a concerted, coordinated program of multidisciplinary research. Such a research program will required the participation of national funding agencies.
Research using GIS has been happening for decades, as has research into basic theories and concepts of computing about geographic space. But, as noted above, current technologies and societal trends are producing great increases in availability of and demand for geographic information systems and services by all sectors of society. The technology and systems will continue to be pushed by military, commercial, and administrative applications, and the research sector is small by comparison. Without investment on behalf of the research community, the full potential of these systems and technologies for scientific use is unlikely to be realized. Funding from the margins of other disciplines and programs is unlikely to provide the kind of base funding that this emerging multidisciplinary field of study requires.
The President's Information Technology Advisory Committee was established to provide the government with guidance and advice on all areas of high performance computing, communications, and information technologies. The Interim PITAC Report submitted in August 1998 noted that under funding of research related to information technologies is a threat to "U.S. leadership in the emerging 21st-century information-based economy." The worldwide market for GIS software and services has been projected to approach $4 billion in 1999, and US companies appear to have more than half of the world market for GIS software. Including services as well as software and data, 1998 market in the U.S. alone is estimated at $4.2 billion. The US GIS industry is very healthy at present, but could stall out in the future without sufficient government support of basic research into the theoretical foundations of Geographic Information Science. Many aspects of current commercial systems are based in innovations developed in the public or academic sectors in the 1960s and 1970s. The PITAC report specifically mentions GIS in a section on socio-economic impacts of an investment in information technology research.
If the PITAC report leads to increased funding for basic research in information science and technology, a non-trivial proportion of those funds should be specifically directed toward Geographic Information Science research. The general public easily understands many aspects of GIS and related technologies, and an investment in this area is likely to produce tangible benefits obvious to many sectors of society. Such investment could also help assure that US industry will continue to lead in this area. GIS software also serves other industries, and is used in data management by all levels of government. The Next Generation Internet (NGI) and the development of a Global Disaster Information Network (GDIN) are other trends that are consistent with the need for further advances in GIScience and related technologies. Mobile information systems with positioning systems will provide new opportunities and challenges for the telecommunications industry, and have positive implications for the broader field of information technology.
GIScience needs a broad, cross-disciplinary coverage and active involvement of researchers in order to make necessary advances. GIScience needs research in theoretical geography, mathematics, cognitive science, and basic computer and information science, as well as in areas of science and engineering where GIS is applied to scientific problems, such as ecology, earth science, social sciences, and other areas. Clearly, GIS and GIScience research should be supported by many federal agencies. However, the National Science Foundation is the only agency in the United States that supports all aspects of the GIScience and GIS, from theoretical topics to scientific applications. The National Science Foundation is well placed to make a difference to scientific and technical progress in Geographic Information Science.
There are parallel initiatives already being planned within the Foundation as well. NSF's Directorate for Geosciences (GEO) is undertaking a major long-range planning effort to develop a vision of the cutting-edge issues for the geosciences during the first decade of the 21st century. One of the driving forces behind this effort, code-named "GEO Beyond 2000," is improved scientific models, and the other is technology:
The second development is the revolutionary increase in the capability of computer information, and sensor technologies. Our current ability to monitor and observe the Earth system on all spatial and temporal scales is unprecedented, and, when coupled with our ever-increasing ability to store and retrieve vast quantities of archived information for detailed examination, provides for much more rapid knowledge generation and dissemination. These rapidly improving technologies will advance the scientific research to provide the sophisticated tools and monitoring systems that policy makers will need to make informed decisions.
If GEO Beyond 2000 goes forward, it could contribute significantly to needs for 'Research using GIS' in the geosciences. However, it is unlikely that it would provide the broad multidisciplinary support for basic GIScience and for research using GIS in the biological and social sciences.
Even in the short time since the workshop, potential support for computer and information science and technology continues to improve. The U.S. Administration's FY 2000 budget includes $366 million of new money for computing and communications technology, to be implemented through several Federal agencies, of which NSF's $146 million is the largest component. NSF's FY 2000 budget states:
NSF has been asked to serve as lead agency for the Administrations FY 2000 Information Technology for the Twenty-First Century (IT
The participants in the present GIScience workshop felt that specific GIScience research is needed to solve geospatial dimensions of information technology, we recommend that a portion of any increased research money for Information technology be directed specifically toward Geographic Information Science.
Funding initiatives such as the one we are recommending are unlikely to be effective in the absence of a community of scholars that provides both potential applicants as well as norms against which proposals would be evaluated. Geographic Information Science in the United States has such a community. Although the term Geographic Information Science was coined only in the early 1990s, it labeled a science and engineering field that had been emerging during the previous two decades as a consequence of technological transitions in the mapping sciences, convergence of spatial analysis methods, and the development of new technologies for collection and processing of geospatial data. In 1994, representatives of 34 US universities and other research organizations met and decided to establish an organization "dedicated to the development and use of theories, methods, technology, and data for understanding geographic processes, relationships, and pattern." Named the University Consortium for Geographic Information Science (UCGIS), the organization has grown to include 50 universities and 4 other organizations as full members, and some of the leading US IT and GIS firms as affiliate members. The University members are estimated to have more than a thousand individual GIS-related researchers and educators, and these are only part of the US academic research community. In 1996, the UCGIS established 10 "National Research Priorities" that have been used to promote research in the field, and in 1997, they determmined 8 "National Education Priorities". Taken together, the UCGIS research priorities cover a rather broad range of specific research problems in GIScience, and thus these priorities have a different granularity from the research areas discussed in this report. Other resources for the field include the National Center for Geographic Information and Analysis and its current Project Varenius ("Advancing Geographic Information Science"), the Open GIS Consortium, the Federal Geographic Data Committee, and a good number of refereed journals and scholarly conferences. The OpenGIS Consortium, established in 1994 to provide a formal structure and process for developing a specification for interoperable geoprocessing, is another indication of a research and development community. Primarily composed of private sector organizations, OpenGIS also includes government and academic participants. Thirdly, in the public sector, the Federal Geographic Data Committee (FGDC) was established by a presidential Order, also in 1994, to support public and private sector applications of geospatial data in such areas as transportation, community development, agriculture, emergency response, environmental management, and information technology; state and local governments are involved through the National States Geographic Information Council (NSGIC) and the National Association of Counties (NACO).
In a climate where research funding is widely acknowledged to be tight, it is difficult to know what level should be devoted to new activities in Geographic Information Science. There are, however, several recent or future tendencies that may provide some basis for such decisions. For example, recently, NSF devoted $50 million per year to the Knowledge and Distributed Intelligence (KDI) initiative. For the future, the PITAC report recommended a diverse portfolio of funding for information technology, including single-investigator efforts, multi-investigator projects, and centers. One of the more novel ideas presented in the PITAC report is "Expeditions into the 21st Century," described as follows:
"Expeditions into the 21st Century" will be virtual centers that bring together scientists, engineers, and computer scientists from academia, government, and industry to "live in the technological future." The mission of these expeditions will be to report back to the Nation what could be accomplished by using technologies that are quantitatively and qualitatively more powerful than those available today.
They recommend that such Expeditions be very well funded:
The full term of an Expedition would be ten years. To encourage truly aggressive efforts, very high annual funding levels should be possible, say up to $40 million per center.
PITAC also recommended " Enabling Technology Centers" at up to $10 million per year.
The workshop participants believe that investment on that order of magnitude is needed to have a real influence on amount of basic GIScience research conducted and the number of disciplines now using GIS and geospatial methods in their research. Considering the size of the US GIS industry and of government annual expenditures on GIS, GIScience research funding on the order of $40 million per year is readily justified. NSF has available many models for administering and funding research, ranging from individual proposals and programs to cross-Directorate initiatives and Centers. Educational initiatives should not be ignored, nor should physical infrastructure such as computing hardware or buildings to house GIScience research activities. Geographic information pervades, or should pervade, all computing about the environment and society, for administration, management, planning, disaster response, and business as well as applied research. GIScience provides the basic intellectual underpinnings for geographic information technologies, and GIScience research should be supported at levels appropriate to the importance of these technologies and their application.
The workshop participants make the following recommendations to the National Science Foundation:
Last updated on March 1 1999