This study aims to establish a scientific and methodological basis for predicting shoreline positions using modern data analysis and machine learning techniques. The focus area is a 5 km section of the Ural coast along Baydaratskaya Bay in the Kara Sea. This region was selected due to its diverse geomorphological features, varied lithological composition, and significant presence of permafrost …
This study investigates the application of Compositional Data Analysis (CoDA) and multivariate statistical techniques to geochemical data from the soils of the Campania region. The dataset examined includes 3571 soil samples analyzed for 37 chemical elements. Principal Component Analysis (PCA) was employed to reduce the dataset’s dimensionality and identify key relationships between elements.…
The connectivity of sandbodies is a key constraint to the exploration effectiveness of Bohai A Oilfield. Conventional connectivity studies often use methods such as seismic attribute fusion, while the development of contiguous composite sandbodies in this area makes it challenging to characterize connectivity changes with conventional seismic attributes. Aiming at the above problem in the Bohai…
Large Language Models (LLMs) have made significant advancements in natural language processing and human-like response generation. However, training and fine-tuning an LLM to fit the strict requirements in the scope of academic research, such as geoscience, still requires significant computational resources and human expert alignment to ensure the quality and reliability of the generated conten…
Geochemical data are compositional in nature and are subject to the problems typically associated with data that are restricted to the real non-negative number space with constant-sum constraint, that is, the simplex. Geochemistry can be considered a proxy for mineralogy, comprised of atomically ordered structures that define the placement and abundance of elements in the mineral lattice struct…
This paper presents an enhanced 3D heat map for exploratory data analysis (EDA) of open mineral data, addressing the challenges caused by rapidly evolving datasets and ensuring scientifically meaningful data exploration. The Mindat website, a crowd-sourced database of mineral species, provides a constantly updated open data source via its newly established application programming interface (API…
Since the advent of modern computing, geochemists have increasingly relied on computers to garner efficiencies in calculations, data analysis, and data presentation. Entirely new fields, such as Monte Carlo-based simulation and geochemical modeling, have developed under this paradigm. With continued growth in computing power, machine learning has become an increasingly popular tool in aqueous g…