Difference between revisions of "DataFed"

From Earth Science Information Partners (ESIP)
Line 10: Line 10:
 
|DataSystemCoverage=About half of the datasets are global scale, a third are US-scale, while some datasets are for other regions. Most datasets are multi-year in extent. About a ten datasets are near-real-time.
 
|DataSystemCoverage=About half of the datasets are global scale, a third are US-scale, while some datasets are for other regions. Most datasets are multi-year in extent. About a ten datasets are near-real-time.
 
|DataSystemAppHealth=No applications to health studies. However, the datasets mediated through DataFed are suitable for health studies, particularly in conjunction with the 1km-resolution US population data.
 
|DataSystemAppHealth=No applications to health studies. However, the datasets mediated through DataFed are suitable for health studies, particularly in conjunction with the 1km-resolution US population data.
|DataSystemAppFcstReAnaly=A current NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS] uses DataFed to assimilate surface obs. into a forecast model. We are not aware of any formal Air Quality Reanalysis effort; hopefully, thee community will
+
|DataSystemAppFcstReAnaly=A current NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS] uses DataFed to assimilate surface obs. into a forecast model. We are not aware of any formal Air Quality Reanalysis effort; hopefully, thee community will
 
|DataSystemAppModelEval=The [http://capita.wustl.edu/NEISGEI/main.html EPA NEISGEI Project] uses DataFed to integrate and to evaluate multiple emission databases. DataFed was used to prepare an [http://capita.wustl.edu/models3eval/IMPComp/FinalReporModels3.htm evaluation of the CMAQ Aerosol Model with IMPROVE and FRM data]. In the NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS], DataFed provides surface observations for assimilation into a forecast model. Add HTAP data integration...
 
|DataSystemAppModelEval=The [http://capita.wustl.edu/NEISGEI/main.html EPA NEISGEI Project] uses DataFed to integrate and to evaluate multiple emission databases. DataFed was used to prepare an [http://capita.wustl.edu/models3eval/IMPComp/FinalReporModels3.htm evaluation of the CMAQ Aerosol Model with IMPROVE and FRM data]. In the NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS], DataFed provides surface observations for assimilation into a forecast model. Add HTAP data integration...
 
|DataSystemAppCharact=DataFed was used to perform aerosol characterization for the RPO project, [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET]. DataFed is the main data source supporting the development of [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA's Exceptional Event Rule]. It is now used in the [[Exceptional_Air_Pollution_Event_Analysis_Community_Workspace| implementation of the EE Rule.]]
 
|DataSystemAppCharact=DataFed was used to perform aerosol characterization for the RPO project, [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET]. DataFed is the main data source supporting the development of [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA's Exceptional Event Rule]. It is now used in the [[Exceptional_Air_Pollution_Event_Analysis_Community_Workspace| implementation of the EE Rule.]]

Revision as of 10:43, January 29, 2008

<Back to Data Summit Workspace <All Data Systems
Edit with Form or Submit Word Doc

General

Contact

Data System Name: DataFed
Data System URL: http://datafedwiki.wustl.edu
Contact Person: Rudy Husar
Contact e-mail: rhusar@me.wustl.edu

Background

About the Data System (Purposes, Audience)

DataFed is Web services-based software that non-intrusively mediates between autonomous, distributed data providers and users. DataFed is designed in accordance with the GEOSS architecture; It provides standard interfaces to heterogeneous distributed data, fosters data integration and use with processing web services and tools, and collects metadata and user-feedback on datasets. DataFed also provides standards-based data feeds to the NASA Giovanni System.

Presentation

Not Given

History

[[DataSystemHistory::The federated data system, DataFed, was in development since 2001 at Washington University, CAPITA, with grants from NSF, NASA, EPA and RPOs. Since 2004, DataFed served both Regulatory and Policy support to EPA. Within CAPITA, DataFed has become a scientific data analysis tool.]]

Agencies

Washington University

List of Publications, Papers, Presentations

Data System Scope

Data Content

Datasets Served

Not Given

Parameters

[[DataSystemParam::DataFed provides access to over 100+ distributed, air-quality relevant datasets (surface, satellite, and model) which can be explored and analyzed by tools for processing and visualization.]]

Spatial - Temporal Coverage

About half of the datasets are global scale, a third are US-scale, while some datasets are for other regions. Most datasets are multi-year in extent. About a ten datasets are near-real-time.

Applications/Potential


Health

No applications to health studies. However, the datasets mediated through DataFed are suitable for health studies, particularly in conjunction with the 1km-resolution US population data.

Forecasting and Reanalysis

[[DataSystemAppFcstReAnaly::A current NASA project with BAMS uses DataFed to assimilate surface obs. into a forecast model. We are not aware of any formal Air Quality Reanalysis effort; hopefully, thee community will]]

Model/Emissions Evaluation

[[DataSystemAppModelEval::The EPA NEISGEI Project uses DataFed to integrate and to evaluate multiple emission databases. DataFed was used to prepare an evaluation of the CMAQ Aerosol Model with IMPROVE and FRM data. In the NASA project with BAMS, DataFed provides surface observations for assimilation into a forecast model. Add HTAP data integration...]]

Characterization, Trends, Accountability

[[DataSystemAppCharact::DataFed was used to perform aerosol characterization for the RPO project, FASTNET. DataFed is the main data source supporting the development of EPA's Exceptional Event Rule. It is now used in the implementation of the EE Rule.]]

Other

[[DataSystemAppOther::Since 2004, a major role of DataFed was to participate in interoperability experiments for GEOSS.]]

Data System IT

Primary/Official Store for Some data

Not Given

Data Consolidation/integration

Not Given

Providing Data Access to users/externals

[[DataSystemValueAccess::DataFed is a homogenizer of distributed, heterogeneous datasets through data 'wrappers'. As a result all the data mediated in DataFed are accessible through international standard data access services, OGC WCS and WMS. At this time all data access services are free and offered through an open interface.]]

Data Processing

The processing of raw data is performed by reusable web-service components, which include filtering, aggregation, and data fusion services. Data processing applications are created by chaining services using workflow software.

Visualization/Analysis

[[DataSystemValueVis::The visualization tools for parameter-spatial-temporal browsing are applicable for each dataset in the federated data system. The output data from the processing services are also available for mashups with other popular tools e.g. Google Earth and GIS software.]]

Decision Support (e.g. some integration into user business process)

Not Given

End-to-End Integration

[[EndtoEndIntegration::Data access, processing and visualization are all performed within DataFed. Specific workflow configurations are created from the loosely coupled web services for different scientific analysis or decision-support applications. An example custom workflow is the Combined Aerosol Trajectory Tool (CATT).]]

Other DS Values

Not Given

Data Access and/or Output Interoperability

[[DataSystemArchInterop::Both the raw input data as well as the processed outputs are accessible through international standard interfaces. This allows the creation of loosely-coupled network applications (Paper-PDF).]]

Reusable Tools and Methods

[[DataSystemArchToolsMethods::The data access, processing and visualization services in DataFed are all composed of reusable Web Services through both SOAP and REST protocols (Paper-PDF).]]

Security Barriers and Solutions

The data access and processing services are accessible through the SOAP-WSDL protocol, which is designed to pass through firewalls. At this time there are no access restrictions to these services.

User Feedback Approach

[[DataSystemArchUserFeedbck::For each dataset registered in DataFed there is a "DataSpace" wiki page for the collection of dataset-relavent information, including user feedback (e.g. AirNOW).]]

Other Architecture

[[DataSystemArchOther::The DataFed architecture has been used as a model for demonstrating the "System of Systems" aspect of GEOSS.]]

User Provided Content