DataFed

From Federation of Earth Science Information Partners

< Back to all Air Quality Data Providers | Edit with Form

Contact

Provider Abbreviation: DataFed
Provider URL: http://datafed.net
Location:: 38.65, -90.30
Networks: HTAP, GEOSS, ACPortal


50px

About the Data Provider (Purposes, Audience):


<Back to Data Summit Workspace <All Data Systems
Edit with Form or Submit Word Doc

General

Contact

Data System Name: DataFed
Data System URL: http://datafedwiki.wustl.edu
Contact Person: Rudy Husar
Contact e-mail: [email protected]

Background

About the Data System (Purposes, Audience):

DataFed is Web services-based software that non-intrusively mediates between autonomous, distributed data providers and users. DataFed is designed in accordance with the GEOSS architecture; It provides standard interfaces to heterogeneous distributed data, fosters data integration and use with processing web services and tools, and collects metadata and user-feedback on datasets. DataFed also provides standards-based data feeds to the NASA Giovanni System.Property "About" (as page type) with input value "DataFed is Web services-based software that non-intrusively mediates between autonomous, distributed data providers and users. DataFed is designed in accordance with the GEOSS architecture; It provides standard interfaces to heterogeneous distributed data, fosters data integration and use with processing web services and tools, and collects metadata and user-feedback on datasets. DataFed also provides standards-based data feeds to the NASA Giovanni System." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Presentation:

Not Given

History:

The federated data system, DataFed, was in development since 2001 at Washington University, CAPITA, with grants from [http://capita.wustl.edu/capita/researchareas/NSFPropSubm/FinalReport/FinalReport_050629.pdf NSF], [http://capita.wustl.edu/capita/researchareas/NASAReason/ESEPMNASAReasonAbstr.htm NASA], [http://capita.wustl.edu/NEISGEI/main.html EPA] and [http://datafedwiki.wustl.edu/images/c/c4/EM_DataFed_FASTNET_050720.pdf RPOs]. Since 2004, DataFed served both Regulatory and Policy support to EPA. Within CAPITA, DataFed has become a scientific data analysis tool.Property "DataSystemHistory" (as page type) with input value "The federated data system, DataFed, was in development since 2001 at Washington University, CAPITA, with grants from NSF, NASA, EPA and RPOs. Since 2004, DataFed served both Regulatory and Policy support to EPA. Within CAPITA, DataFed has become a scientific data analysis tool." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Agencies:

Washington University

List of Publications, Papers, Presentations:

http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET],[http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA Exceptional Event Project], [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project Interoperability of Web Service-Based Data Access and Processing...] ESTO 2006, [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool, CATT], [http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf DataFed: Mediated Web Services for Distributed AQ Data Access and Processing] IGARS 2007, [http://datafedwiki.wustl.edu/index.php/2007-01-25_HTAP_Mtg_Geneva Interoperable Info System of Systems for HTAP] HTAP 2007Property "DataSystemRef" (as page type) with input value "http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET],EPA Exceptional Event Project, Interoperability of Web Service-Based Data Access and Processing... ESTO 2006, Combined Aerosol Trajectory Tool, CATT, DataFed: Mediated Web Services for Distributed AQ Data Access and Processing IGARS 2007, Interoperable Info System of Systems for HTAP HTAP 2007" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Data System Scope

Data Content


Datasets Served:

http://datafedwiki.wustl.edu/index.php/Compact_Catalog_-_Alphabetical See Dataset Catalog

Parameters:

DataFed provides access to over [http://datafedwiki.wustl.edu/index.php/Compact_Catalog_-_Alphabetical 100+] distributed, air-quality relevant datasets (surface, satellite, and model) which can be explored and analyzed by tools for processing and visualization.Property "DataSystemParam" (as page type) with input value "DataFed provides access to over 100+ distributed, air-quality relevant datasets (surface, satellite, and model) which can be explored and analyzed by tools for processing and visualization." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Spatial - Temporal Coverage:

About half of the datasets are global scale, a third are US-scale, while some datasets are for other regions. Most datasets are multi-year in extent. About a ten datasets are near-real-time.

Applications/Potential


Health:

No applications to health studies. However, the datasets mediated through DataFed are suitable for health studies, particularly in conjunction with the 1km-resolution US population data.

Forecasting and Reanalysis:

A current NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS] uses DataFed to assimilate surface obs. into a forecast model. We are not aware of any formal Air Quality Reanalysis effort; hopefully, thee community willProperty "DataSystemAppFcstReAnaly" (as page type) with input value "A current NASA project with BAMS uses DataFed to assimilate surface obs. into a forecast model. We are not aware of any formal Air Quality Reanalysis effort; hopefully, thee community will" contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Model/Emissions Evaluation:

The [http://capita.wustl.edu/NEISGEI/main.html EPA NEISGEI Project] uses DataFed to integrate and to evaluate multiple emission databases. DataFed was used to prepare an [http://capita.wustl.edu/models3eval/IMPComp/FinalReporModels3.htm evaluation of the CMAQ Aerosol Model with IMPROVE and FRM data]. In the NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS], DataFed provides surface observations for assimilation into a forecast model. Add HTAP data integration...Property "DataSystemAppModelEval" (as page type) with input value "The EPA NEISGEI Project uses DataFed to integrate and to evaluate multiple emission databases. DataFed was used to prepare an evaluation of the CMAQ Aerosol Model with IMPROVE and FRM data. In the NASA project with BAMS, DataFed provides surface observations for assimilation into a forecast model. Add HTAP data integration..." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Characterization, Trends, Accountability:

DataFed was used to perform aerosol characterization for the RPO project, [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET]. DataFed is the main data source supporting the development of [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA's Exceptional Event Rule]. It is now used in the [[Exceptional_Air_Pollution_Event_Analysis_Community_Workspace| implementation of the EE Rule.Property "DataSystemAppCharact" (as page type) with input value "DataFed was used to perform aerosol characterization for the RPO project, FASTNET. DataFed is the main data source supporting the development of EPA's Exceptional Event Rule. It is now used in the [[Exceptional_Air_Pollution_Event_Analysis_Community_Workspace| implementation of the EE Rule." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Other:

Since 2004, a major role of DataFed was to participate in interoperability experiments for GEOSS.Property "DataSystemAppOther" (as page type) with input value "Since 2004, a major role of DataFed was to participate in interoperability experiments for GEOSS." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Data System IT

Primary/Official Store for Some data:

DataFed is a mediator of data flow between providers and users. It does not primary/official data.

Data Consolidation/integration:

Data consolidation from heterogeneous to homogeneous structure is performed on the fly for most datasets. Many historical datasets are cached at DataFed for fast data access and browsing.

Providing Data Access to users/externals:

DataFed is a homogenizer of distributed, heterogeneous datasets through [http://esto.nasa.gov/conferences/ESTC2006/papers/a6p2.pdf data 'wrappers']. As a result all the data mediated in DataFed are accessible through international [http://esto.nasa.gov/conferences/ESTC2006/papers/a6p2.pdf standard data access services, OGC WCS and WMS]. At this time all data access services are free and offered through an open interface.Property "DataSystemValueAccess" (as page type) with input value "DataFed is a homogenizer of distributed, heterogeneous datasets through data 'wrappers'. As a result all the data mediated in DataFed are accessible through international standard data access services, OGC WCS and WMS. At this time all data access services are free and offered through an open interface." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Data Processing:

The processing of raw data is performed by reusable web-service components, which include filtering, aggregation, and data fusion services. Data processing applications are created by chaining services using workflow software.

Visualization/Analysis:

The visualization tools for parameter-spatial-temporal browsing are applicable for each dataset in the federated data system. The output data from the processing services are also available for mashups with other popular tools e.g. [http://datafedwiki.wustl.edu/index.php/2007-07-18_ESIP_Demo_OMI_NO2 Google Earth] and GIS software.Property "DataSystemValueVis" (as page type) with input value "The visualization tools for parameter-spatial-temporal browsing are applicable for each dataset in the federated data system. The output data from the processing services are also available for mashups with other popular tools e.g. Google Earth and GIS software." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Decision Support (e.g. some integration into user business process):

DataFed has served the RPOs through the [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET] and [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool (CATT)] projects. More recently DataFed supports the decisions for the Exceptional Event Rule for PM2.5 and ozone.Property "DataSystemValueDecisionSupport" (as page type) with input value "DataFed has served the RPOs through the FASTNET and Combined Aerosol Trajectory Tool (CATT) projects. More recently DataFed supports the decisions for the Exceptional Event Rule for PM2.5 and ozone." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

End-to-End Integration:

Data access, processing and visualization are all performed within DataFed. Specific workflow configurations are created from the loosely coupled web services for different scientific analysis or decision-support applications. An example custom workflow is the [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool (CATT)].Property "EndtoEndIntegration" (as page type) with input value "Data access, processing and visualization are all performed within DataFed. Specific workflow configurations are created from the loosely coupled web services for different scientific analysis or decision-support applications. An example custom workflow is the Combined Aerosol Trajectory Tool (CATT)." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Other DS Values:

Not Given

Data Access and/or Output Interoperability:

Both the raw input data as well as the processed outputs are accessible through international standard interfaces. This allows the creation of [http://datafedwiki.wustl.edu/index.php/2007-07-25_IGARSS07_Barcelona loosely-coupled network applications] ([http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf Paper-PDF]).Property "DataSystemArchInterop" (as page type) with input value "Both the raw input data as well as the processed outputs are accessible through international standard interfaces. This allows the creation of loosely-coupled network applications (Paper-PDF)." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Reusable Tools and Methods:

The data access, processing and visualization services in DataFed are all composed of reusable Web Services through both SOAP and REST protocols ([http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf Paper-PDF]).Property "DataSystemArchToolsMethods" (as page type) with input value "The data access, processing and visualization services in DataFed are all composed of reusable Web Services through both SOAP and REST protocols (Paper-PDF)." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Security Barriers and Solutions:

The data access and processing services are accessible through the SOAP-WSDL protocol, which is designed to pass through firewalls. At this time there are no access restrictions to these services.

User Feedback Approach:

For each dataset registered in DataFed there is a "DataSpace" wiki page for the collection of dataset-relavent information, including user feedback (e.g. AirNOW).Property "DataSystemArchUserFeedbck" (as page type) with input value "For each dataset registered in DataFed there is a "DataSpace" wiki page for the collection of dataset-relavent information, including user feedback (e.g. AirNOW)." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

Other Architecture:

The DataFed architecture has been used as a model for demonstrating the [http://datafedwiki.wustl.edu/index.php/2007-01-25_HTAP_Mtg_Geneva "System of Systems" aspect of GEOSS].Property "DataSystemArchOther" (as page type) with input value "The DataFed architecture has been used as a model for demonstrating the "System of Systems" aspect of GEOSS." contains invalid characters or is incomplete and therefore can cause unexpected results during a query or annotation process.

User Provided Content

HTAPEEACC