lynx   »   [go: up one dir, main page]

IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v130y2025i2d10.1007_s11192-025-05233-1.html
   My bibliography  Save this article

GAN-CITE: leveraging semi-supervised generative adversarial networks for citation function classification with limited data

Author

Listed:
  • Krittin Chatrinan

    (Mahidol University)

  • Thanapon Noraset

    (Mahidol University)

  • Suppawong Tuarob

    (Mahidol University)

Abstract
Citation function analysis is crucial to understanding how cited literature contributes to the overall discourse and meaning conveyed in scientific publications. Citation functions serve diverse roles that must be accurately identified and categorized. Still, the field of citation function analysis faces challenges due to limited labeled data and the complexity of defining and categorizing citation functions, which require expertise and a deep understanding of scientific literature. This limitation results in imprecise identification and categorization of citation functions, emphasizing the need for further advancements to improve the accuracy and reliability of citation function analysis. This paper proposes GAN-CITE, a novel framework employing semi-supervised learning techniques to address these limitations. Its primary objective is to efficiently leverage available unlabeled data by combining generative adversarial networks (GANs) and the language model to incorporate substantial data representations from unlabeled data sources. Our study demonstrates that GAN-CITE outperforms both supervised and semi-supervised state-of-the-art models in limited data settings, namely 10%, 20%, and 30% of the total labeled data. We also examine its performance in insufficient and imbalanced labeled data situations, as well as the potential of unlabeled data utilization. These findings highlight the success of generative adversarial networks in enhancing citation function classification and their applications in digital libraries that require precise citation function categorization, such as trend analysis and impact quantification, under limited annotated data.

Suggested Citation

  • Krittin Chatrinan & Thanapon Noraset & Suppawong Tuarob, 2025. "GAN-CITE: leveraging semi-supervised generative adversarial networks for citation function classification with limited data," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(2), pages 679-703, February.
  • Handle: RePEc:spr:scient:v:130:y:2025:i:2:d:10.1007_s11192-025-05233-1
    DOI: 10.1007/s11192-025-05233-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-025-05233-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-025-05233-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Xin An & Xin Sun & Shuo Xu, 2022. "Important citations identification with semi-supervised classification model," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6533-6555, November.
    2. Ruihua Qi & Jia Wei & Zhen Shao & Zhengguang Li & Heng Chen & Yunhao Sun & Shaohua Li, 2023. "Multi-task learning model for citation intent classification in scientific publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(12), pages 6335-6355, December.
    3. Setio Basuki & Masatoshi Tsuchiya, 2022. "SDCF: semi-automatically structured dataset of citation functions," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(8), pages 4569-4608, August.
    4. Chanathip Pornprasit & Xin Liu & Pattararat Kiattipadungkul & Natthawut Kertkeidkachorn & Kyoung-Sook Kim & Thanapon Noraset & Saeed-Ul Hassan & Suppawong Tuarob, 2022. "Enhancing citation recommendation using citation network embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 233-264, January.
    5. Ma, Shutian & Zhang, Chengzhi & Zhang, Heng & Gao, Zheng, 2025. "Citation recommendation based on argumentative zoning of user queries," Journal of Informetrics, Elsevier, vol. 19(1).
    6. Milena Vuletić & Felix Prenzel & Mihai Cucuringu, 2024. "Fin-GAN: forecasting and classifying financial time series via generative adversarial networks," Quantitative Finance, Taylor & Francis Journals, vol. 24(2), pages 175-199, January.
    7. Yang Zhang & Rongying Zhao & Yufei Wang & Haihua Chen & Adnan Mahmood & Munazza Zaib & Wei Emma Zhang & Quan Z. Sheng, 2022. "Correction to: Towards employing native information in citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6579-6579, November.
    8. Yang Zhang & Rongying Zhao & Yufei Wang & Haihua Chen & Adnan Mahmood & Munazza Zaib & Wei Emma Zhang & Quan Z. Sheng, 2022. "Towards employing native information in citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6557-6577, November.
    9. Xiaorui Jiang & Jingqiang Chen, 2023. "Contextualised segment-wise citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5117-5158, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ruihua Qi & Jia Wei & Zhen Shao & Zhengguang Li & Heng Chen & Yunhao Sun & Shaohua Li, 2023. "Multi-task learning model for citation intent classification in scientific publications," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(12), pages 6335-6355, December.
    2. Yi Zhang & Chengzhi Zhang & Philipp Mayr & Arho Suominen, 2022. "An editorial of “AI + informetrics”: multi-disciplinary interactions in the era of big data," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6503-6507, November.
    3. Percia David, Dimitri & Maréchal, Loïc & Lacube, William & Gillard, Sébastien & Tsesmelis, Michael & Maillart, Thomas & Mermoud, Alain, 2023. "Measuring security development in information technologies: A scientometric framework using arXiv e-prints," Technological Forecasting and Social Change, Elsevier, vol. 188(C).
    4. Indra Budi & Yaniasih Yaniasih, 2023. "Understanding the meanings of citations using sentiment, role, and citation function classifications," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(1), pages 735-759, January.
    5. Li, Xin & Tang, Xuli & Lu, Wei, 2024. "Investigating clinical links in edge-labeled citation networks of biomedical research: A translational science perspective," Journal of Informetrics, Elsevier, vol. 18(3).
    6. Xiaorui Jiang & Jingqiang Chen, 2023. "Contextualised segment-wise citation function classification," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5117-5158, September.
    7. Faiza Qayyum & Harun Jamil & Naeem Iqbal & DoHyeun Kim & Muhammad Tanvir Afzal, 2022. "Toward potential hybrid features evaluation using MLP-ANN binary classification model to tackle meaningful citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6471-6499, November.
    8. Guo Chen & Jing Chen & Yu Shao & Lu Xiao, 2023. "Automatic noise reduction of domain-specific bibliographic datasets using positive-unlabeled learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(2), pages 1187-1204, February.
    9. Nikolas Michael & Mihai Cucuringu & Sam Howison, 2024. "A GCN-LSTM Approach for ES-mini and VX Futures Forecasting," Papers 2408.05659, arXiv.org.
    10. Wei Cheng & Dejun Zheng & Shaoxiong Fu & Jingfeng Cui, 2024. "Closer in time and higher correlation: disclosing the relationship between citation similarity and citation interval," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 4495-4512, July.
    11. Jiaying Liu & Jun Zhang, 2025. "Publication recommendation in incomplete networks based on graph learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 130(2), pages 565-591, February.
    12. Chien-chih Huang & Kuang-hua Chen, 2024. "RefCit2vec: embedding models considering references and citations for measuring document similarity," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(8), pages 4669-4693, August.
    13. Xiaojuan Zhang & Shuqi Song & Yuping Xiong, 2024. "Personalized global citation recommendation with diversification awareness," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 3625-3657, July.
    14. Howard Caulfield & James P. Gleeson, 2024. "Systematic comparison of deep generative models applied to multivariate financial time series," Papers 2412.06417, arXiv.org.
    15. Bokai Cao & Saizhuo Wang & Xinyi Lin & Xiaojun Wu & Haohan Zhang & Lionel M. Ni & Jian Guo, 2025. "From Deep Learning to LLMs: A survey of AI in Quantitative Investment," Papers 2503.21422, arXiv.org.
    16. Nimbeshaho Thierry & Bing-Kun Bao & Zafar Ali, 2023. "RAR-SB: research article recommendation using SciBERT with BiGRU," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(12), pages 6427-6448, December.
    17. Yang, Alex Jie & Wu, Linwei & Zhang, Qi & Wang, Hao & Deng, Sanhong, 2023. "The k-step h-index in citation networks at the paper, author, and institution levels," Journal of Informetrics, Elsevier, vol. 17(4).
    18. Zhenye Huang & Deyou Tang & Rong Zhao & Wenjing Rao, 2024. "A scientific paper recommendation method using the time decay heterogeneous graph," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(3), pages 1589-1613, March.
    19. Yonghe Lu & Meilu Yuan & Jiaxin Liu & Minghong Chen, 2023. "Research on semantic representation and citation recommendation of scientific papers with multiple semantics fusion," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(2), pages 1367-1393, February.
    20. Orzechowski, Kamil P. & Mrowinski, Maciej J. & Fronczak, Agata & Fronczak, Piotr, 2023. "Asymmetry of social interactions and its role in link predictability: The case of coauthorship networks," Journal of Informetrics, Elsevier, vol. 17(2).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:130:y:2025:i:2:d:10.1007_s11192-025-05233-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.
    Лучший частный хостинг