PERPUSTAKAAN BIG

  • Beranda
  • Informasi
  • Berita
  • Bantuan
  • Area Pustakawan
  • Area Anggota
  • Pilih Bahasa :
    Bahasa Arab Bahasa Bengal Bahasa Brazil Portugis Bahasa Inggris Bahasa Spanyol Bahasa Jerman Bahasa Indonesia Bahasa Jepang Bahasa Melayu Bahasa Persia Bahasa Rusia Bahasa Thailand Bahasa Turki Bahasa Urdu
Image of Streamlining geoscience data analysis with an LLM-driven workflow

Text

Streamlining geoscience data analysis with an LLM-driven workflow

Jiyin Zhang - Nama Orang; Xiaogang Ma - Nama Orang; Xiang Que - Nama Orang; Wenjia Li - Nama Orang; Weilin Chen - Nama Orang; Chenhao Li - Nama Orang; Cory Clairmont - Nama Orang;

Large Language Models (LLMs) have made significant advancements in natural language processing and human-like response generation. However, training and fine-tuning an LLM to fit the strict requirements in the scope of academic research, such as geoscience, still requires significant computational resources and human expert alignment to ensure the quality and reliability of the generated content. The challenges highlight the need for a more flexible and reliable LLM workflow to meet domain-specific analysis needs. This study proposes an LLM-driven workflow that addresses the challenges of utilizing LLMs in geoscience data analysis. The work was built upon the open data API (application programming interface) of Mindat, one of the largest databases in mineralogy. We designed and developed an open-source LLM-driven workflow that processes natural language requests and automatically utilizes the Mindat API, mineral co-occurrence network analysis, and locality distribution heat map visualization to conduct geoscience data analysis tasks. Using prompt engineering techniques, we developed a supervisor-based agentic framework that enables LLM agents to not only interpret context information but also autonomously addressing complex geoscience analysis tasks, bridging the gap between automated workflows and human expertise. This agentic design emphasizes autonomy, allowing the workflow to adapt seamlessly to future advancements in LLM capabilities without requiring additional fine-tuning or domain-specific embedding. By providing the comprehensive context of the task in the workflow and the professional tool, we ensure the quality of LLM-generated content without the need to embed geoscience knowledge into LLMs through fine-tuning or human alignment. Our approach integrates LLMs into geoscience data analysis, addressing the need for specialized tools while reducing the learning curve through LLM-driven interactions between users and APIs. This streamlined workflow enhances the efficiency of exploratory data analysis, as demonstrated by the several use cases presented. In our future work we will explore the scalability of this workflow through the integration of additional agents and diverse geoscience data sources.


Ketersediaan
244551.136Perpustakaan BIG (Eksternal Harddisk)Tersedia
Informasi Detail
Judul Seri
Applied Computing and Geoscience - Open Access
No. Panggil
551.136
Penerbit
Amsterdam : Elsevier., 2025
Deskripsi Fisik
10 hlm PDF, 6.546 KB
Bahasa
Inggris
ISBN/ISSN
2590-1974
Klasifikasi
551.136
Tipe Isi
text
Tipe Media
-
Tipe Pembawa
-
Edisi
Vol.25, February 2025
Subjek
Large language model
AI agent
Prompt engineering
Geoscience data analysis
Mindat
Info Detail Spesifik
-
Pernyataan Tanggungjawab
-
Versi lain/terkait

Tidak tersedia versi lain

Lampiran Berkas
  • Streamlining geoscience data analysis with an LLM-driven workflow
    Large Language Models (LLMs) have made significant advancements in natural language processing and human-like response generation. However, training and fine-tuning an LLM to fit the strict requirements in the scope of academic research, such as geoscience, still requires significant computational resources and human expert alignment to ensure the quality and reliability of the generated content. The challenges highlight the need for a more flexible and reliable LLM workflow to meet domain-specific analysis needs. This study proposes an LLM-driven workflow that addresses the challenges of utilizing LLMs in geoscience data analysis. The work was built upon the open data API (application programming interface) of Mindat, one of the largest databases in mineralogy. We designed and developed an open-source LLM-driven workflow that processes natural language requests and automatically utilizes the Mindat API, mineral co-occurrence network analysis, and locality distribution heat map visualization to conduct geoscience data analysis tasks. Using prompt engineering techniques, we developed a supervisor-based agentic framework that enables LLM agents to not only interpret context information but also autonomously addressing complex geoscience analysis tasks, bridging the gap between automated workflows and human expertise. This agentic design emphasizes autonomy, allowing the workflow to adapt seamlessly to future advancements in LLM capabilities without requiring additional fine-tuning or domain-specific embedding. By providing the comprehensive context of the task in the workflow and the professional tool, we ensure the quality of LLM-generated content without the need to embed geoscience knowledge into LLMs through fine-tuning or human alignment. Our approach integrates LLMs into geoscience data analysis, addressing the need for specialized tools while reducing the learning curve through LLM-driven interactions between users and APIs. This streamlined workflow enhances the efficiency of exploratory data analysis, as demonstrated by the several use cases presented. In our future work we will explore the scalability of this workflow through the integration of additional agents and diverse geoscience data sources.
    Other Resource Link
Komentar

Anda harus masuk sebelum memberikan komentar

PERPUSTAKAAN BIG
  • Informasi
  • Layanan
  • Pustakawan
  • Area Anggota

Tentang Kami

Perpustakaan Badan Informasi Geospasial (BIG) adalah sebuah perpustakaan yang berada di bawah Badan Informasi Geospasial Indonesia. Perpustakaan ini memiliki koleksi yang berkaitan dengan informasi geospasial, termasuk peta, data geospasial, dan literatur terkait. Selengkapnya

Cari

masukkan satu atau lebih kata kunci dari judul, pengarang, atau subjek

Donasi untuk SLiMS Kontribusi untuk SLiMS?

© 2025 — Senayan Developer Community

Ditenagai oleh SLiMS
Pilih subjek yang menarik bagi Anda
  • Batas Wilayah
  • Ekologi
  • Fotogrametri
  • Geografi
  • Geologi
  • GIS
  • Ilmu Tanah
  • Kartografi
  • Manajemen Bencana
  • Oceanografi
  • Penginderaan Jauh
  • Peta
Icons made by Freepik from www.flaticon.com
Pencarian Spesifik