Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
37 tokens/sec
GPT-4o
11 tokens/sec
Gemini 2.5 Pro Pro
37 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
10 tokens/sec
DeepSeek R1 via Azure Pro
33 tokens/sec
2000 character limit reached

'Intelligence Studies Network': A human-curated database for indexing resources with open-source tools (2408.03868v1)

Published 7 Aug 2024 in cs.DL and cs.DB

Abstract: The Intelligence Studies Network is a comprehensive resource database for publications, events, conferences, and calls for papers in the field of intelligence studies. It offers a novel solution for monitoring, indexing, and visualising resources. Sources are automatically monitored and added to a manually curated database, ensuring the relevance of items to intelligence studies. Curated outputs are stored in a group library on Zotero, an open-source reference management tool. The metadata of items in Zotero is enriched with OpenAlex, an open access bibliographic database. Finally, outputs are listed and visualised on a Streamlit app, an open-source Python framework for building apps. This paper aims to explain the Intelligence Studies Network database and provide a detailed guide on data sources and the workflow. This study demonstrates that it is possible to create a specialised academic database by using open source tools.

Summary

  • The paper introduces a curated database that indexes specialized resources in intelligence studies.
  • It employs open-source tools like Zotero and OpenAlex combined with manual curation to overcome automated filtering challenges.
  • By enhancing research efficiency and offering trend analytics, the approach sets a precedent for future academic databases.

Overview of the Intelligence Studies Network Database

The paper, "Intelligence Studies Network: A Human-Curated Database for Monitoring and Indexing Resources on Intelligence Studies" by Yusuf A. Ozkan, presents an insightful discourse on the development and functionality of the Intelligence Studies Network (ISN). This database offers an exhaustive repository of publications, events, conferences, and calls for papers specifically within the domain of intelligence studies. It addresses critical challenges inherent in automating data collection and offers an innovative solution by leveraging open-source tools.

Data Sources and Collection Methods

The paper explicates the intricate process of data collection for the ISN database. Given the multifaceted nature of the term "intelligence," the automated collection poses substantial difficulties. Terms like "intelligence" are employed across diverse disciplines, from artificial intelligence to cognitive intelligence, leading to the inclusion of irrelevant sources. Consequently, the ISN emphasizes manual curation to ensure content relevance.

Key Methodologies:

  • OpenAlex API: Utilized for retrieving journal articles based on journal names and specific keywords. This choice is instrumental due to OpenAlex's extensive coverage and significant API call limits.
  • Flagship Journals: Articles from prominent journals in intelligence studies are included comprehensively. For other journals, keywords in titles help filter pertinent articles.
  • Manual Monitoring: For books and grey literature, manual monitoring using Google Scholar alerts and other sources ensures a broad coverage.

Structure and Interface

The primary database for references is Zotero, chosen for its capability to manage complex publication metadata and its open-source nature. Zotero's group libraries facilitate shared management and usage. The enriched dataset, including citation and open access information from OpenAlex, is stored in a GitHub repository.

Website Interface:

  • Home Page: Provides key metrics, search functionalities, and an overview of recent additions, citations, and publications.
  • Collections: Organized thematically to offer targeted searches within specific sub-fields of intelligence studies. It comprises 12 collections, ranging from historical intelligence to covert actions and global intelligence.
  • Events and Digest: Lists relevant events, conferences, and calls for papers, updated manually to maintain accuracy. The Digest section consolidates recent updates.
  • Institutions: A directory of institutions, academic programs, and other resources related to intelligence studies.

Limitations

The paper acknowledges several limitations:

  1. Manual Curation Dependency: Despite advantages in relevance, manual updating may lead to omissions and subjective biases.
  2. Incomplete Coverage: Especially for grey literature and non-English sources, comprehensive inclusion remains challenging.
  3. Citation and Open Access Limitations: These data points may not be available for all items due to the dependency on OpenAlex.

Implications and Future Directions

The ISN represents a significant contribution to the intelligence studies field, offering a comprehensive database that addresses the unique challenges of sourcing relevant publications. By leveraging open-source tools like Zotero, OpenAlex, and Streamlit, this project exemplifies the potential of combining various open-source platforms to create specialized academic databases.

Practical Implications:

  • Enhanced Research Efficiency: Researchers save time by accessing a curated repository, eliminating irrelevant sources.
  • Visual Analytics: The dashboards and metrics facilitate trend analysis and scholarly insights.

Theoretical Implications:

  • Proof of Concept: Demonstrates the feasibility of specialized databases using open-source frameworks.
  • Future Research: Opens avenues for similar databases in other academic fields, promoting open access and community-driven data curation.

Looking forward, potential developments may focus on improving the automation of data collection, enhancing multi-language support, and expanding grey literature inclusion. By continually refining the database, the ISN can better serve the intelligence academic community and foster a deeper understanding of the field.

In conclusion, the development of the Intelligence Studies Network is a commendable effort towards creating a focused and efficient resource for researchers. It sets a valuable precedent for future projects aiming to harness the potential of open-source tools for specialized academic databases.

X Twitter Logo Streamline Icon: https://streamlinehq.com