XCast: A Gridpoint-Wise Statistical Modeling Library for the Earth Sciences
XCast is a free and open source (passion) project by Kyle Hall & Nachiketa Acharya, designed to help Earth Scientists scale single-point-in-space regression approaches to spatial gridded data using the popular Earth Science data tool, Xarray. XCast is designed to be high-performance, intuitive, and easily extensible. It is our hope that XCast will serve to bridge the gap between the two-dimensional world of Python Data Science (Samples x Features), and the four-dimensional world of climate data (Samples x Features x Latitude x Longitude).Explore the docs (Our Website) »
Report Bug · Request Feature
Table of Contents
Why XCast?
Numerous problems in the Earth Sciences are solved by finding statistical relationships (multiple-regression type) between variables measured at a given point in space, across time. Often, it’s desirable to apply these approaches at many points in space, on a ‘gridpoint-wise’ basis. While Python has numerous statistical and machine learning libraries, none are designed to accomodate fitting more than one statistical model at once, i.e., at many points in space, as is required by this gridpoint-wise approach.
XCast enables users to apply Python’s various statistical tools to spatial gridded data on a gridpoint-wise basis, without having to manually track and manage different dimensions, lists of model instances, or metadata. Built on Xarray and Dask, two powerful data science libraries, XCast is capable of analyzing “Big-Data” that won’t fit in RAM, and can be scaled to supercomputer clusters. It is designed to be extended to accomodate new statistical libraries easily, and to maximize synergy with the PanGEO stack and other Earth Science data analytics packages like XClim, ClimPred, and XSkillScore.
License
Distributed under the MIT License. See LICENSE
for more information.
Contact
Email: kjhall@iri.columbia.edu (This is a side project, so it may take a while to get back to you)