Search Results

Now showing 1 - 7 of 7
  • Item
    Using Bayesian Networks to Investigate the Influence of Subseasonal Arctic Variability on Midlatitude North Atlantic Circulation
    (Boston, Mass. [u.a.] : AMS, 2021) Harwood, Nathanael; Hall, Richard; Di Capua, Giorgia; Russell, Andrew; Tucker, Allan
    Recent enhanced warming and sea ice depletion in the Arctic have been put forward as potential drivers of severe weather in the midlatitudes. Evidence of a link between Arctic warming and midlatitude atmospheric circulation is growing, but the role of Arctic processes relative to other drivers remains unknown. Arctic–midlatitude connections in the North Atlantic region are particularly complex but important due to the frequent occurrence of severe winters in recent decades. Here, dynamic Bayesian networks with hidden variables are introduced to the field to assess their suitability for teleconnection analyses. Climate networks are constructed to analyze North Atlantic circulation variability at 5-day to monthly time scales during the winter months of the years 1981–2018. The inclusion of a number of Arctic, midlatitude, and tropical variables allows for an investigation into the relative role of Arctic influence compared to internal atmospheric variability and other remote drivers. A robust covariability between regions of amplified Arctic warming and two definitions of midlatitude circulation is found to occur entirely within winter at submonthly time scales. Hidden variables incorporated in networks represent two distinct modes of stratospheric polar vortex variability, capturing a periodic shift between average conditions and slower anomalous flow. The influence of the Barents–Kara Seas region on the North Atlantic Oscillation is found to be the strongest link at 5- and 10-day averages, while the stratospheric polar vortex strongly influences jet variability on monthly time scales.
  • Item
    Topology of products similarity network for market forecasting
    ([Cham] : Springer International Publishing, 2019) Fan, Jingfang; Cohen, Keren; Shekhtman, Louis M.; Liu, Sibo; Meng, Jun; Louzoun, Yoram; Havlin, Shlomo
    The detection and prediction of risk in financial markets is one of the main challenges of economic forecasting, and draws much attention from the scientific community. An even more challenging task is the prediction of the future relative gain of companies. We here develop a novel combination of product text analysis, network theory and topological based machine learning to study the future performance of companies in financial markets. Our network links are based on the similarity of firms’ products and constructed using the Securities Exchange Commission (SEC) filings of US listed firms. We find that several topological features of this network can serve as good precursors of risks or future gain of companies. We then apply machine learning to network attributes vectors for each node to predict successful and failing firms. The resulting accuracies are much better than current state of the art techniques. The framework presented here not only facilitates the prediction of financial markets but also provides insight and demonstrates the power of combining network theory and topology based machine learning.
  • Item
    Master Memory Function for Delay-Based Reservoir Computers With Single-Variable Dynamics
    ([New York, NY] : IEEE, 2022) Köster, Felix; Yanchuk, Serhiy; Lüdge, Kathy
    We show that many delay-based reservoir computers considered in the literature can be characterized by a universal master memory function (MMF). Once computed for two independent parameters, this function provides linear memory capacity for any delay-based single-variable reservoir with small inputs. Moreover, we propose an analytical description of the MMF that enables its efficient and fast computation. Our approach can be applied not only to single-variable delay-based reservoirs governed by known dynamical rules, such as the Mackey–Glass or Stuart–Landau-like systems, but also to reservoirs whose dynamical model is not available.
  • Item
    Statistical stopping criteria for automated screening in systematic reviews
    (London : Biomed Central, 2020) Callaghan, Max W.; Müller-Hansen, Finn
    Active learning for systematic review screening promises to reduce the human effort required to identify relevant documents for a systematic review. Machines and humans work together, with humans providing training data, and the machine optimising the documents that the humans screen. This enables the identification of all relevant documents after viewing only a fraction of the total documents. However, current approaches lack robust stopping criteria, so that reviewers do not know when they have seen all or a certain proportion of relevant documents. This means that such systems are hard to implement in live reviews. This paper introduces a workflow with flexible statistical stopping criteria, which offer real work reductions on the basis of rejecting a hypothesis of having missed a given recall target with a given level of confidence. The stopping criteria are shown on test datasets to achieve a reliable level of recall, while still providing work reductions of on average 17%. Other methods proposed previously are shown to provide inconsistent recall and work reductions across datasets.
  • Item
    Remote Sensing Based Yield Estimation of Rice (Oryza Sativa L.) Using Gradient Boosted Regression in India
    (Basel : MDPI, 2021) Arumugam, Ponraj; Chemura, Abel; Schauberger, Bernhard; Gornott, Christoph
    Accurate and spatially explicit yield information is required to ensure farmers’ income and food security at local and national levels. Current approaches based on crop cutting experiments are expensive and usually too late for timely income stabilization measures like crop insurances. We, therefore, utilized a Gradient Boosted Regression (GBR), a machine learning technique, to estimate rice yields at ~500 m spatial resolution for rice-producing areas in India with potential application for near real-time estimates. We used resampled intermediate resolution (~5 km) images of the Moderate Resolution Imaging Spectroradiometer (MODIS) Leaf Area Index (LAI) and observed yields at the district level in India for calibrating GBR models. These GBRs were then used to downscale district yields to 500 m resolution. Downscaled yields were re-aggregated for validation against out-of-sample district yields not used for model training and an additional independent data set of block-level (below district-level) yields. Our downscaled and re-aggregated yields agree well with reported district-level observations from 2003 to 2015 (r = 0.85 & MAE = 0.15 t/ha). The model performance improved further when estimating separate models for different rice cropping densities (up to r = 0.93). An additional out-of-sample validation for the years 2016 and 2017, proved successful with r = 0.84 and r = 0.77, respectively. Simulated yield accuracy was higher in water-limited, rainfed agricultural systems. We conclude that this downscaling approach of rice yield estimation using GBR is feasible across India and may complement current approaches for timely rice yield estimation required by insurance companies and government agencies.
  • Item
    Estimation of hourly near surface air temperature across Israel using an ensemble model
    (Basel : MDPI, 2020) Zhou, Bin; Erell, Evyatar; Hough, Ian; Shtein, Alexandra; Just, Allan C.; Novack, Victor; Rosenblatt, Jonathan; Kloog, Itai
    Mapping of near-surface air temperature (Ta) at high spatio-temporal resolution is essential for unbiased assessment of human health exposure to temperature extremes, not least given the observed trend of urbanization and global climate change. Data constraints have led previous studies to focus merely on daily Ta metrics, rather than hourly ones, making them insufficient for intra-day assessment of health exposure. In this study, we present a three-stage machine learning-based ensemble model to estimate hourly Ta at a high spatial resolution of 1 × 1 km2, incorporating remotely sensed surface skin temperature (Ts) from geostationary satellites, reanalysis synoptic variables, and observations from weather stations, as well as auxiliary geospatial variables, which account for spatio-temporal variability of Ta. The Stage 1 model gap-fills hourly Ts at 4 × 4 km2 from the Spinning Enhanced Visible and InfraRed Imager (SEVIRI), which are subsequently fed into the Stage 2 model to estimate hourly Ta at the same spatio-temporal resolution. The Stage 3 model downscales the residuals between estimated and measured Ta to a grid of 1 × 1 km2, taking into account additionally the monthly diurnal pattern of Ts derived from the Moderate Resolution Imaging Spectroradiometer (MODIS) data. In each stage, the ensemble model synergizes estimates from the constituent base learners—random forest (RF) and extreme gradient boosting (XGBoost)—by applying a geographically weighted generalized additive model (GAM), which allows the weights of results from individual models to vary over space and time. Demonstrated for Israel for the period 2004–2017, the proposed ensemble model outperformed each of the two base learners. It also attained excellent five-fold cross-validated performance, with overall root mean square error (RMSE) of 0.8 and 0.9 °C, mean absolute error (MAE) of 0.6 and 0.7 °C, and R2 of 0.95 and 0.98 in Stage 1 and Stage 2, respectively. The Stage 3 model for downscaling Ta residuals to 1 km MODIS grids achieved overall RMSE of 0.3 °C, MAE of 0.5 °C, and R2 of 0.63. The generated hourly 1 × 1 km2 Ta thus serves as a foundation for monitoring and assessing human health exposure to temperature extremes at a larger geographical scale, helping to further minimize exposure misclassification in epidemiological studies.
  • Item
    Supermodeling by combining imperfect models
    (Amsterdam : Elsevier B.V., 2011) Selten, F.M.; Duane, G.S.; Wiegerinck, W.; Keenlyside, N.; Kurths, J.; Kocarev, L.
    SUMO (Supermodeling by combining imperfect models) is a three-year project funded under FET Open program with a starting date October, 1, 2010. We review some basic facts and findings of the SUMO project.