Search Results Heading

MBRLSearchResults

mbrl.module.common.modules.added.book.to.shelf
Title added to your shelf!
View what I already have on My Shelf.
Oops! Something went wrong.
Oops! Something went wrong.
While trying to add the title to your shelf something went wrong :( Kindly try again later!
Are you sure you want to remove the book from the shelf?
Oops! Something went wrong.
Oops! Something went wrong.
While trying to remove the title from your shelf something went wrong :( Kindly try again later!
    Done
    Filters
    Reset
  • Discipline
      Discipline
      Clear All
      Discipline
  • Is Peer Reviewed
      Is Peer Reviewed
      Clear All
      Is Peer Reviewed
  • Item Type
      Item Type
      Clear All
      Item Type
  • Subject
      Subject
      Clear All
      Subject
  • Year
      Year
      Clear All
      From:
      -
      To:
  • More Filters
      More Filters
      Clear All
      More Filters
      Source
    • Language
600 result(s) for "dummy variable"
Sort by:
Improving Aboveground Biomass Estimation of Pinus densata Forests in Yunnan Using Landsat 8 Imagery by Incorporating Age Dummy Variable and Method Comparison
Optical remote sensing data have been widely used for estimating forest aboveground biomass (AGB). However, the use of optical images is often restricted by the saturation of spectral reflectance for forests that have multilayered and complex canopy structures and high AGB values and by the effect of spectral reflectance from underlayer shrub, grass, and bare soil for young stands. This usually leads to overestimations and underestimations for smaller and larger values, respectively, and makes it very challenging to improve the estimation accuracy of forest AGB. In this study, a novel methodology was proposed by incorporating stand age as a dummy variable into four models to improve the estimation accuracy of the Pinus densata forest AGB in Yunnan of Southwestern China. A total of eight models, including two parametric models (LM: linear regression model and LMC: LM with combined variables), two nonparametric models (RF: random forest and ANN: artificial neural network) without the age dummy variable, and four corresponding models with the age dummy variable (DLM, DLMC, DRF, and DANN), were compared to estimate AGB. Landsat 8 Operational Land Imager (OLI) images and 147 sample plots were acquired and utilized. The results showed that (1) compared with the two parametric models, the two nonparametric algorithms resulted in significantly greater estimation accuracies of Pinus densata forest AGB, and the increases of accuracy varied from 8% to 32% for 100 modeling plots and from 12% to 35% for 47 test plots based on root mean square error (RMSE); (2) compared with the models without the age dummy variable, the models with the age dummy variable greatly reduced the overestimations for the plots with AGB values smaller than 70 Mg/ha and the underestimations for the plots with AGB values larger than 180 Mg/ha and, thus, significantly improved the overall estimation accuracy by 14% to 42% for the modeling plots and by 32% to 44% for the test plots based on RMSE; and (3) the texture measures derived from the Landsat 8 OLI images contributed more to improving the estimation accuracy than the original spectral bands and other transformations. This implied that two nonparametric models, coupled with the use of the age dummy variable and texture measures, offered a great potential for improving the estimation accuracy of Pinus densata forest AGB.
USING OF TRIPLE-TREATED WASTEWATER IN AGRICULTURAL IRRIGATION IN AL-AHSA OASIS, SAUDI ARABIA
The agricultural sector consumes about 84% of the total actual water consumption annually in Kingdom of Saudi Arabia, therefore, the research depends on questionnaire data that was judged before being used in the study of a random sample of 355 farms from Al-Ahsa Oasis which irrigated with two types of water (groundwater, triple-treated wastewater) to measure the economic impact of the using of triple-treated wastewater in irrigating palm and lemon crops, the results indicated a statistically significant correlation between the extent of farmers'  which using of triple treated wastewater in irrigation and all of: size of the tenure, family size and the educational level of the farmer. While the relationship was negative with all of: age of the farmer, and number of years of his experience. The results showed that the productivity of dunums on farms irrigated with triple-treated wastewater increased by 0.32, 0.30, 0.22 tons, representing about 19.2%, 17.2%, 16.8% compared to which using groundwater for Alkhallas palm, Alrziz palm and lemon.
Even the rich can make themselves poor: a critical examination of IV methods in marketing applications
Marketing is a field that is rich in data. Our data is of high quality, often at a highly disaggregate level, and there is considerable variation in the key variables for which estimates of effects on outcomes such as sales and profits are desired. The recognition that, in some general sense, marketing variables are set by firms on the basis of information not always observable by the researcher has led to concerns regarding endogeneity and widespread pressure to implement instrumental variables methods in marketing problems. The instruments used in our empirical literature are rarely valid and the IV methods used can have poor sampling properties, including substantial finite sample bias and large sampling errors. Given the problems with IV methods, a convincing argument must be made that there is a first order endogeneity problem and that we have strong and valid instruments before these methods should be used. If strong and valid instruments are not available, then researchers need to look toward supplementing the information available to them. For example, if there are concerns about unobservable advertising or promotional variables, then the researcher is much better off measuring these variables rather than using instruments (such as lagged marketing variables) that are clearly invalid. Ultimately, only randomized variation in marketing variables (with proper implementation and large samples) can be argued to be a valid instrument without further assumptions.
The use of support vector machine, neural network, and regression analysis to predict and optimize surface roughness and cutting forces in milling
In the present study, prediction and optimization of the surface roughness and cutting forces in slot milling of aluminum alloy 7075-T6 were pursued by taking advantage of regression analysis, support vector regression (SVR), artificial neural network (ANN), and multi-objective genetic algorithm. The effects of process parameters, including cutting speed, feed per tooth, depth of cut, and tool type, on the responses were investigated by the analysis of variance (ANOVA). Grid search and cross-validation methods were used for hyperparameter tuning and to find the best ANN and SVR models. The training algorithm of developed NNs was one of the hyperparameters which was chosen from Levenberg-Marquardt and RMSprop algorithms. The performance of regression, SVR, and ANN models were compared with each other corresponding to each machining response studied. The ANN models were integrated with the non-dominated sorting genetic algorithm (NSGA-II) to find the optimum solutions by means of minimizing the surface roughness and cutting forces. In addition, the desirability function approach was utilized to select proper solutions from the statistical tools.
Exploration of Data Scene Characterization and 3D ROC Evaluation for Hyperspectral Anomaly Detection
Whether or not a hyperspectral anomaly detector is effective is determined by two crucial issues, anomaly detectability and background suppressibility (BS), both of which are very closely related to two factors, the datasets used for a selected hyperspectral anomaly detector and detection measures used for its performance evaluation. This paper explores how anomaly detectability and BS play key roles in hyperspectral anomaly detection (HAD). To address these two issues, we investigate three key elements attributed to HAD. One is a selected hyperspectral anomaly detector, and another is the datasets used for experiments. The third one is the detection measures used to evaluate the effectiveness of a hyperspectral anomaly detector. As for hyperspectral anomaly detectors, twelve commonly used anomaly detectors were evaluated and compared. To address the appropriate use of datasets for HAD, seven popular and widely used datasets were studied for HAD. As for the third issue, the traditional area under a receiver operating characteristic (ROC) curve of detection probability—PD versus false alarm probability, PF, (AUC(D,F))—was extended to 3D ROC analysis where a 3D ROC curve was developed to generate three 2D ROC curves from which eight detection measures could be derived to evaluate HAD in all round aspects, including anomaly detectability, BS and joint anomaly detectability and BS. Qualitative analysis showed that many works reported in the literature which claimed that their developed hyperspectral anomaly detectors performed better than other anomaly detectors are actually not true because they overlooked these two issues. Specifically, a comprehensive study via extensive experiments demonstrated that these 3D ROC curve-derived detection measures can be further used to address the various characterizations of different data scenes and also to provide explanations as to why certain data scenes are not suitable for HAD.
Developing a generalized nonlinear mixed-effects biomass model at stand-level under different age conditions for Chinese fir based on LiDAR and ground survey data in southern China
Chinese fir is a crucial afforestation and timber species in southern China. Accurate estimation of its stand biomass is vital for forest resource assessment, ecological industry development, and ecosystem management. However, traditional biomass prediction methods often face limitations in terms of accuracy and efficiency, highlighting the need for more robust modeling approaches. This study utilized data from 154 forest stands in Guangdong Province to develop biomass regression models that incorporate random effects and dummy variables. The models were based on airborne LiDAR-derived metrics. Among 41 highly correlated LiDAR variables, only two-5% cumulative height percentile and leaf area index-were retained in the final model. The results revealed that the logistic mixed-effects model was the most effective for estimating leaf biomass, while the empirical mixed-effects model was better suited for other biomass components. Nonlinear models outperformed linear models, with the nonlinear mixed-effects model (incorporating stand age as a random effect) achieving the highest predictive accuracy. Furthermore, machine learning techniques further improved model performance (R² = 0.855 to 0.939). Validation with independent test samples confirmed the robustness and reliability of the nonlinear mixed-effects model. This study highlights the effectiveness of airborne LiDAR data in providing efficient and precise estimates of stand biomass. It also emphasizes the significant role of stand developmental stages in biomass modeling. The findings contribute to the development of a rigorous and scalable framework for large-scale artificial forest biomass estimation, which has important implications for forest resource monitoring, ecological industry development, and ecosystem management strategies.
A Three-Level Model System of Biomass and Carbon Storage for All Forest Types in China
Forest biomass and carbon storage models are crucial for inventorying, monitoring, and assessing forest resources. This study develops models specific to China’s diverse forests, offering a methodological foundation for national carbon storage estimation and a quantitative basis for national, regional, and global carbon sequestration projections. Utilizing data from 52,700 permanent plots obtained during China’s 9th national forest inventory, we calculated biomass and carbon storage per hectare for 35 tree species groups using respective individual tree biomass models and carbon factors. We then constructed a three-level volume-based model system for forest biomass and carbon storage, applying weighted regression, dummy variable modeling, and simultaneous equations with error-in-variables. This system encompasses one population of forests, three forest categories (level I), 20 forest types (level II), and 74 forest sub-types (level III). Finally, the assessment of these models was carried out with six evaluation indices, and comparative analyses with previously established biomass models of three major forest types were conducted. Determination coefficients (R2) for the population average model, and three dummy models on levels I, II, and III, exceed 0.78, 0.85, 0.92, and 0.95, respectively, with corresponding mean prediction errors (MPEs) of 0.42%, 0.34%, 0.24%, and 0.19%, and mean percent standard errors (MPSEs) of approximately 22%, 21%, 15%, and 12%. Models for 20 forest types and 74 sub-types yield R2 values above 0.87 and 0.85, with MPE values below 3% and 5%, respectively. Notably, the estimates of previous biomass models of three major forest types demonstrated considerable uncertainty, with TRE ranging from −20% to 74%. However, accuracy has improved with larger sample sizes. In total biomass and carbon storage estimations, the R2 values of dummy models for levels I, II, and III progressively increase and MPSE and MPE values decrease, whereas TRE approximates zero. The tiered model system of simultaneous equations developed herein offers a quantitative framework for precise evaluations of biomass and carbon storage on different scales. For enhanced accuracy in such estimations, applying level III models is recommended whenever feasible, especially for national estimation.
Modeling and evaluating site and provenance variation in height–diameter relationships for Betula alnoides Buch.–Ham. ex D. Don in southern China
Tree height (H) and stem diameter at breast height (DBH) (H-D) relationship is correlated with timber yield and quality as well as stability of forest and is crucial in forest management and genetic breeding. It is influenced by not only environmental factors such as site quality and climate factors but also genetic control that is mostly neglected. A dataset of H and DBH of 25 provenances of Betula alnoides Buch.–Ham. ex D. Don at four sites was used to model the H-D relationship. The dummy variable nonliner mixed-effect equations were applied to evaluate the effects of sites and provenances on variations of the H-D relationship and to select superior provenances of B. alnoides . Weibull equation was selected as the base model for the H-D relationship. The sites affected asymptotes of the H-D curves, and the provenance effect on asymptotes of the H-D curves varied across sites. Taking above-average DBH and lower asymptote of the H-D curves as indicators, five excellent provenances were screened out at each site with a rate of 20%. Their selection gains of individual volume ranged from 1.99% to 29.81%, and their asymptote parameter ( k j ) and H-D ratio were 7.17%–486.05% and 3.07–4.72% lower than the relevant total means at four sites, respectively. Genetic selection based on the H-D relationship could promote selection efficiency of excellent germplasms and was beneficial for the large-sized timber production of B. alnoides .
Modelling individual tree height–diameter relationships for multi-layered and multi-species forests in central Europe
Key messageThe proposed height–diameter model applicable to many tree species in the multi-layered and mixed stands across Czech Republic shows a high accuracy in the height prediction. This model can be useful for estimating forest yield and biomass, and simulation of the vertical stand structures.We developed a generalized nonlinear mixed-effects height–diameter (H–D) model applicable to Norway spruce (Picea abies (L.) Karst.), European beech (Fagus sylvatica L.) and other conifer and broadleaved tree species using the modelling method that includes dummy variables accounting for species-specific height differences and random component accounting for within- and between-sample plot height differences and randomness in the data. We used two large datasets: the first set (model fitting dataset) originated from permanent research sample plots and second set (model-testing dataset) originated from the Czech national forest inventory (NFI) sample plots. The former dataset comprises 224 sample plots with 29,390 trees and the latter dataset comprises 14,903 sample plots with 382,540 trees, each representing wide variabilities of tree size, ecological zone, growth condition, stand structure and development stage, and management regime across the country. Among the four versatile growth functions evaluated as base functions with diameter at breast height (DBH) included as a single predictor, the Chapman-Richards function showed the most attractive fit statistics. This function was then extended through the integration of other predictor variables, which better describe the stand density (stand basal area), stand development and site quality (dominant height), competition (ratio of DBH to quadratic mean DBH), that would act as modifiers of the original parameters of the function. The mixed-effects H–D model described a large part of the variations in the H–D relationships (\\[R_{{{\\text{adj}}}^{2}\\] = 0.9182; RMSE = 2.7786) without substantial trends in the residuals. Testing this model against model-testing dataset confirmed the model’s high accuracy. The model may be used for estimating forest yield and biomass, and therefore will serve as an important tool for decision making in forestry.
Adjusting treatment effect estimates by post-stratification in randomized experiments
Experimenters often use post-stratification to adjust estimates. Post-stratification is akin to blocking, except that the number of treated units in each stratum is a random variable because stratification occurs after treatment assignment. We analyse both post-stratification and blocking under the Neyman—Rubin model and compare the efficiency of these designs. We derive the variances for a post-stratified estimator and a simple difference-in-means estimator under different randomization schemes. Post-stratification is nearly as efficient as blocking: the difference in their variances is of the order of 1/n 2 , with a constant depending on treatment proportion. Post-stratification is therefore a reasonable alternative to blocking when blocking is not feasible. However, in finite samples, post-stratification can increase variance if the number of strata is large and the strata are poorly chosen. To examine why the estimators' variances are different, we extend our results by conditioning on the observed number of treated units in each stratum. Conditioning also provides more accurate variance estimates because it takes into account how close (or far) a realized random sample is from a comparable blocked experiment. We then show that the practical substance of our results remains under an infinite population sampling model. Finally, we provide an analysis of an actual experiment to illustrate our analytical results.