Difference between revisions of "DataFed"

From Earth Science Information Partners (ESIP)
(Removing all content from page)
Line 1: Line 1:
{{DataSystemProfile
+
 
|DataSystemName=DataFed
 
|DataSystemURL=http://datafedwiki.wustl.edu
 
|ContactName=Rudy Husar
 
|Contacte-mail=rhusar@me.wustl.edu
 
|About=DataFed is Web services-based software that non-intrusively mediates between autonomous, distributed data providers and users. DataFed is designed in accordance with the GEOSS architecture; It provides standard interfaces to heterogeneous distributed data, fosters data integration and use with processing web services and tools, and collects metadata and user-feedback on datasets. DataFed also provides standards-based data feeds to the NASA Giovanni System.
 
|DataSystemHistory=The federated data system, DataFed, was in development since 2001 at Washington University, CAPITA, with grants from [http://capita.wustl.edu/capita/researchareas/NSFPropSubm/FinalReport/FinalReport_050629.pdf NSF], [http://capita.wustl.edu/capita/researchareas/NASAReason/ESEPMNASAReasonAbstr.htm NASA], [http://capita.wustl.edu/NEISGEI/main.html EPA] and [http://datafedwiki.wustl.edu/images/c/c4/EM_DataFed_FASTNET_050720.pdf RPOs]. Since 2004, DataFed served both Regulatory and Policy support to EPA. Within CAPITA, DataFed has become a scientific data analysis tool.
 
|DataSystemAgencies=Washington University
 
|DataSystemRef=[http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET],[http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA Exceptional Event Project], [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project Interoperability of Web Service-Based Data Access and Processing...] ESTO 2006, [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool, CATT], [http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf DataFed: Mediated Web Services for Distributed AQ Data Access and Processing] IGARS 2007, [http://datafedwiki.wustl.edu/index.php/2007-01-25_HTAP_Mtg_Geneva Interoperable Info System of Systems for HTAP] HTAP 2007
 
|DataSystemDataSets=[http://datafedwiki.wustl.edu/index.php/Compact_Catalog_-_Alphabetical See Dataset Catalog]
 
|DataSystemParam=DataFed provides access to over [http://datafedwiki.wustl.edu/index.php/Compact_Catalog_-_Alphabetical 100+] distributed, air-quality relevant datasets (surface, satellite, and model) which can be explored and analyzed by tools for processing and visualization.
 
|DataSystemCoverage=About half of the datasets are global scale, a third are US-scale, while some datasets are for other regions. Most datasets are multi-year in extent. About a ten datasets are near-real-time.
 
|DataSystemAppHealth=No applications to health studies. However, the datasets mediated through DataFed are suitable for health studies, particularly in conjunction with the 1km-resolution US population data.
 
|DataSystemAppFcstReAnaly=A current NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS] uses DataFed to assimilate surface obs. into a forecast model. We are not aware of any formal Air Quality Reanalysis effort; hopefully, thee community will
 
|DataSystemAppModelEval=The [http://capita.wustl.edu/NEISGEI/main.html EPA NEISGEI Project] uses DataFed to integrate and to evaluate multiple emission databases. DataFed was used to prepare an [http://capita.wustl.edu/models3eval/IMPComp/FinalReporModels3.htm evaluation of the CMAQ Aerosol Model with IMPROVE and FRM data]. In the NASA project with [http://groups.google.com/group/nasa-aq-forecast BAMS], DataFed provides surface observations for assimilation into a forecast model. Add HTAP data integration...
 
|DataSystemAppCharact=DataFed was used to perform aerosol characterization for the RPO project, [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET]. DataFed is the main data source supporting the development of [http://datafedwiki.wustl.edu/index.php/2005/6_Exceptional_Events_Project EPA's Exceptional Event Rule]. It is now used in the [[Exceptional_Air_Pollution_Event_Analysis_Community_Workspace| implementation of the EE Rule.]]
 
|DataSystemAppOther=Since 2004, a major role of DataFed was to participate in [[EPA_Air_Quality_Data_Systems_and_GEOSS_Architecture| interoperability experiments for GEOSS]].
 
|PrimaryDataStorage=DataFed is a mediator of data flow between providers and users. It does not primary/official data.
 
|DataSystemValueConsolidation=Data consolidation from heterogeneous to homogeneous structure is performed on the fly for most datasets. Many historical datasets are cached at DataFed for fast data access and browsing.
 
|DataSystemValueAccess=DataFed is a homogenizer of distributed, heterogeneous datasets through [http://esto.nasa.gov/conferences/ESTC2006/papers/a6p2.pdf data 'wrappers']. As a result all the data mediated in DataFed are accessible through international [http://esto.nasa.gov/conferences/ESTC2006/papers/a6p2.pdf standard data access services, OGC WCS and WMS]. At this time all data access services are free and offered through an open interface.
 
|DataSystemValueProcess=The processing of raw data is performed by reusable web-service components, which include filtering, aggregation, and data fusion services. Data processing applications are created by chaining services using workflow software.
 
|DataSystemValueVis=The visualization tools for parameter-spatial-temporal browsing are applicable for each dataset in the federated data system. The output data from the processing services are also available for mashups with other popular tools e.g.  [http://datafedwiki.wustl.edu/index.php/2007-07-18_ESIP_Demo_OMI_NO2 Google Earth] and GIS software.
 
|DataSystemValueDecisionSupport=DataFed has served the RPOs through the [http://datafedwiki.wustl.edu/index.php/FASTNET FASTNET] and [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool (CATT)] projects. More recently DataFed supports the decisions for the Exceptional Event Rule for PM2.5 and ozone.
 
|EndtoEndIntegration=Data access, processing and visualization are all performed within DataFed. Specific workflow configurations are created from the loosely coupled web services for different scientific analysis or decision-support applications. An example custom workflow is the [http://datafedwiki.wustl.edu/index.php/CATT Combined Aerosol Trajectory Tool (CATT)].
 
|DataSystemArchInterop=Both the raw input data as well as the processed outputs are accessible through international standard interfaces. This allows the creation of [http://datafedwiki.wustl.edu/index.php/2007-07-25_IGARSS07_Barcelona loosely-coupled network applications] ([http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf Paper-PDF]).
 
|DataSystemArchToolsMethods=The data access, processing and visualization services in DataFed are all composed of reusable Web Services through both SOAP and REST protocols ([http://capita.wustl.edu/capita/capitareports/070725IGARSS07_Barcelona/DataFed_IGARSS07_Barcelona_ER3.pdf Paper-PDF]).
 
|DataSystemArchSecurity=The data access and processing services are accessible through the SOAP-WSDL protocol, which is designed to pass through firewalls. At this time there are no access restrictions to these services.
 
|DataSystemArchUserFeedbck=For each dataset registered in DataFed there is a "DataSpace" wiki page for the collection of dataset-relavent information, including user feedback (e.g. [[AIRNOW|AirNOW]]).
 
|DataSystemArchOther=The DataFed architecture has been used as a model for demonstrating the [http://datafedwiki.wustl.edu/index.php/2007-01-25_HTAP_Mtg_Geneva "System of Systems" aspect of GEOSS].
 
}}
 

Revision as of 17:31, February 5, 2008