Recommendation

Informative description of a project implementing a CIDOC-CRM based native graph database for representing megalithic information

Isto Huvila based on reviews by 2 anonymous reviewers

A recommendation of:

Transforming the CIDOC-CRM model into a megalithic monument property graph

Ariele Câmara, Ana de Almeida, João Oliveira (2024), Zenodo, ver.4, peer-reviewed and recommended by PCI Archaeology https://doi.org/10.5281/zenodo.7981230

Read preprint in preprint server Now published in a journal

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

Transforming the CIDOC-CRM model into a megalithic monument property graph

This paper presents a method to store information about megalithic monuments' building components as graph nodes in a knowledge graph (KG). As a case study we analyse the dolmens from the region of Pavia (Portugal). To build the KG, information has been extracted from unstructured data to populate a schema model based on the International Committee for Documentation - Conceptual Reference Model (CIDOC-CRM). In order to prepare the archaeological monument's information for bulk loading, it was transformed into semi-structured data. While the semi-structured file was used to populate the classes with their respective properties and instances, the KG labels and types were defined using some of the entities and relations defined by the CIDOC-CRM. The knowledge-driven model was built to represent dolmens in a formal and structured manner using Neo4J, a property-graph database. Modeling a labeled property graph based on predefined labels as a KG enables to transform textual semantic data into instances and properties. Thus, we show that it is possible to represent at a granular level all the information about the structural components of monuments since heterogeneities, granularities, and large amounts of data can be handled by a KG. Therefore, a KG implemented using a native graph database can improve data storage and processing, making it interoperable either between humans, between humans and machines and machine-to-machine.

knowledge graph, dolmen, CIDOC-CRM, labeled property graph, Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

تحويل نموذج CIDOC-CRM إلى رسم بياني لخصائص النصب الصخري

تقدم هذه الورقة طريقة لتخزين المعلومات حول مكونات بناء الآثار الصخرية كعقد بيانية في الرسم البياني المعرفي (KG). كدراسة حالة قمنا بتحليل الدولمينات من منطقة بافيا (البرتغال). لبناء رياض الأطفال، تم استخراج المعلومات من البيانات غير المنظمة لملء نموذج مخطط يعتمد على اللجنة الدولية للتوثيق - النموذج المرجعي المفاهيمي (CIDOC-CRM). ومن أجل إعداد معلومات النصب الأثري للتحميل بالجملة، تم تحويلها إلى بيانات شبه منظمة. بينما تم استخدام الملف شبه المنظم لملء الفئات بخصائصها ومثيلاتها، تم تحديد تسميات وأنواع KG باستخدام بعض الكيانات والعلاقات المحددة بواسطة CIDOC-CRM. تم بناء النموذج القائم على المعرفة لتمثيل الدولمينات بطريقة رسمية ومنظمة باستخدام Neo4J، وهي قاعدة بيانات الرسم البياني للخصائص. إن نمذجة الرسم البياني للخصائص المسمى بناءً على التسميات المحددة مسبقًا في مرحلة الروضة تمكن من تحويل البيانات الدلالية النصية إلى مثيلات وخصائص. وبالتالي، نوضح أنه من الممكن تمثيل جميع المعلومات حول المكونات الهيكلية للآثار على المستوى الجزئي، حيث يمكن التعامل مع عدم التجانس والتفاصيل والكميات الكبيرة من البيانات بواسطة KG. لذلك، يمكن لرياض الأطفال التي يتم تنفيذها باستخدام قاعدة بيانات الرسم البياني الأصلية تحسين تخزين البيانات ومعالجتها، مما يجعلها قابلة للتشغيل المتبادل إما بين البشر، أو بين البشر والآلات، أو من آلة إلى آلة.

الرسم البياني للمعرفة، الدولمين، CIDOC-CRM، الرسم البياني للملكية المسمى، Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Transformando el modelo CIDOC-CRM en un gráfico de propiedades de monumentos megalíticos

Este artículo presenta un método para almacenar información sobre los componentes de construcción de monumentos megalíticos como nodos gráficos en un gráfico de conocimiento (KG). Como caso de estudio analizamos los dólmenes de la región de Pavía (Portugal). Para construir el KG, se extrajo información de datos no estructurados para completar un modelo de esquema basado en el Comité Internacional de Documentación - Modelo de Referencia Conceptual (CIDOC-CRM). Para preparar la información del monumento arqueológico para la carga masiva, se transformó en datos semiestructurados. Mientras que el archivo semiestructurado se utilizó para completar las clases con sus respectivas propiedades e instancias, las etiquetas y tipos de KG se definieron utilizando algunas de las entidades y relaciones definidas por CIDOC-CRM. El modelo basado en conocimiento se construyó para representar dólmenes de manera formal y estructurada utilizando Neo4J, una base de datos de gráficos de propiedades. Modelar un gráfico de propiedades etiquetado basado en etiquetas predefinidas como un KG permite transformar datos semánticos textuales en instancias y propiedades. Por lo tanto, mostramos que es posible representar a nivel granular toda la información sobre los componentes estructurales de los monumentos, ya que un KG puede manejar heterogeneidades, granularidades y grandes cantidades de datos. Por lo tanto, un KG implementado utilizando una base de datos gráfica nativa puede mejorar el almacenamiento y procesamiento de datos, haciéndolo interoperable ya sea entre humanos, entre humanos y máquinas y de máquina a máquina.

gráfico de conocimiento, dolmen, CIDOC-CRM, gráfico de propiedades etiquetadas, Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Transformer le modèle CIDOC-CRM en graphe de propriétés de monuments mégalithiques

Cet article présente une méthode pour stocker des informations sur les composants de construction des monuments mégalithiques sous forme de nœuds de graphe dans un graphe de connaissances (KG). À titre d'étude de cas, nous analysons les dolmens de la région de Pavie (Portugal). Pour construire le KG, des informations ont été extraites de données non structurées pour alimenter un modèle de schéma basé sur le Comité international pour la documentation - Modèle de référence conceptuel (CIDOC-CRM). Afin de préparer les informations du monument archéologique au chargement en masse, celles-ci ont été transformées en données semi-structurées. Alors que le fichier semi-structuré était utilisé pour remplir les classes avec leurs propriétés et instances respectives, les étiquettes et types KG ont été définis à l'aide de certaines des entités et relations définies par le CIDOC-CRM. Le modèle basé sur la connaissance a été construit pour représenter les dolmens de manière formelle et structurée à l'aide de Neo4J, une base de données de graphes de propriétés. La modélisation d'un graphe de propriétés étiquetées basé sur des étiquettes prédéfinies en tant que KG permet de transformer des données sémantiques textuelles en instances et propriétés. Ainsi, nous montrons qu'il est possible de représenter à un niveau granulaire toutes les informations sur les composants structurels des monuments puisque les hétérogénéités, les granularités et les grandes quantités de données peuvent être gérées par un KG. Par conséquent, une KG implémentée à l'aide d'une base de données graphique native peut améliorer le stockage et le traitement des données, la rendant interopérable soit entre humains, entre humains et machines et de machine à machine.

graphe de connaissances, dolmen, CIDOC-CRM, graphe de propriétés étiquetées, Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

CIDOC-CRM मॉडल को मेगालिथिक स्मारक संपत्ति ग्राफ में बदलना

यह पेपर मेगालिथिक स्मारकों के निर्माण घटकों के बारे में जानकारी को ज्ञान ग्राफ (केजी) में ग्राफ नोड्स के रूप में संग्रहीत करने की एक विधि प्रस्तुत करता है। एक केस अध्ययन के रूप में हम पाविया (पुर्तगाल) क्षेत्र से डोलमेन्स का विश्लेषण करते हैं। केजी के निर्माण के लिए, अंतर्राष्ट्रीय दस्तावेज़ीकरण समिति - संकल्पनात्मक संदर्भ मॉडल (सीआईडीओसी-सीआरएम) के आधार पर एक स्कीमा मॉडल तैयार करने के लिए असंरचित डेटा से जानकारी निकाली गई है। पुरातात्विक स्मारक की जानकारी को थोक लोडिंग के लिए तैयार करने के लिए, इसे अर्ध-संरचित डेटा में बदल दिया गया था। जबकि अर्ध-संरचित फ़ाइल का उपयोग कक्षाओं को उनके संबंधित गुणों और उदाहरणों से भरने के लिए किया गया था, KG लेबल और प्रकारों को CIDOC-CRM द्वारा परिभाषित कुछ संस्थाओं और संबंधों का उपयोग करके परिभाषित किया गया था। ज्ञान-संचालित मॉडल एक संपत्ति-ग्राफ़ डेटाबेस, Neo4J का उपयोग करके औपचारिक और संरचित तरीके से डोलमेंस का प्रतिनिधित्व करने के लिए बनाया गया था। केजी के रूप में पूर्वनिर्धारित लेबल के आधार पर एक लेबल प्रॉपर्टी ग्राफ़ को मॉडलिंग करने से टेक्स्ट सिमेंटिक डेटा को उदाहरणों और गुणों में बदलने में सक्षम बनाता है। इस प्रकार, हम दिखाते हैं कि स्मारकों के संरचनात्मक घटकों के बारे में सभी जानकारी को एक विस्तृत स्तर पर प्रस्तुत करना संभव है क्योंकि विविधता, ग्रैन्युलैरिटी और बड़ी मात्रा में डेटा को केजी द्वारा नियंत्रित किया जा सकता है। इसलिए, एक देशी ग्राफ़ डेटाबेस का उपयोग करके कार्यान्वित केजी डेटा भंडारण और प्रसंस्करण में सुधार कर सकता है, जिससे यह मनुष्यों के बीच, मनुष्यों और मशीनों के बीच और मशीन-टू-मशीन के बीच अंतर-संचालित हो सकता है।

ज्ञान ग्राफ़, डोलमेन, CIDOC-CRM, लेबल प्रॉपर्टी ग्राफ़, Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

CIDOC-CRM モデルを巨石記念碑の特性グラフに変換する

この論文では、巨石記念碑の建築コンポーネントに関する情報をナレッジグラフ (KG) のグラフノードとして保存する方法を紹介します。ケーススタディとして、私たちはパヴィア（ポルトガル）地域のドルメンを分析します。 KG を構築するために、非構造化データから情報が抽出され、国際文書化委員会 - 概念参照モデル (CIDOC-CRM) に基づいてスキーマモデルが設定されています。考古学的記念碑の情報を一括読み込みできるように準備するために、情報は半構造化データに変換されました。半構造化ファイルは、クラスにそれぞれのプロパティとインスタンスを設定するために使用されましたが、KG ラベルとタイプは、CIDOC-CRM によって定義されたエンティティと関係の一部を使用して定義されました。知識主導型モデルは、特性グラフデータベースである Neo4J を使用して、正式かつ構造化された方法でドルメンを表現するために構築されました。事前定義されたラベルに基づいてラベル付きプロパティグラフを KG としてモデル化すると、テキストのセマンティックデータをインスタンスとプロパティに変換できます。したがって、異質性、粒度、および大量のデータを KG で処理できるため、記念碑の構造コンポーネントに関するすべての情報を粒度レベルで表現できることを示します。したがって、ネイティブグラフデータベースを使用して実装された KG は、データの保存と処理を改善し、人間間、人間とマシン間、マシン間の相互運用性を実現します。

ナレッジグラフ、ドルメン、CIDOC-CRM、ラベル付きプロパティグラフ、Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Transformando o modelo CIDOC-CRM em um gráfico de propriedades de monumentos megalíticos

Este artigo apresenta um método para armazenar informações sobre componentes de construção de monumentos megalíticos como nós de grafos em um grafo de conhecimento (KG). Como estudo de caso analisamos as antas da região de Pavia (Portugal). Para construir o KG, foram extraídas informações de dados não estruturados para preencher um modelo de esquema baseado no Comitê Internacional de Documentação - Modelo de Referência Conceitual (CIDOC-CRM). Para preparar a informação do monumento arqueológico para carregamento em massa, esta foi transformada em dados semiestruturados. Enquanto o arquivo semiestruturado foi utilizado para preencher as classes com suas respectivas propriedades e instâncias, os rótulos e tipos do KG foram definidos utilizando algumas das entidades e relações definidas pelo CIDOC-CRM. O modelo orientado ao conhecimento foi construído para representar dólmens de maneira formal e estruturada usando Neo4J, um banco de dados de gráficos de propriedades. Modelar um gráfico de propriedades rotuladas com base em rótulos predefinidos como um KG permite transformar dados semânticos textuais em instâncias e propriedades. Assim, mostramos que é possível representar de forma granular toda a informação sobre os componentes estruturais dos monumentos, uma vez que heterogeneidades, granularidades e grandes quantidades de dados podem ser tratadas por um KG. Portanto, um KG implementado usando um banco de dados gráfico nativo pode melhorar o armazenamento e processamento de dados, tornando-o interoperável entre humanos, entre humanos e máquinas e máquina a máquina.

gráfico de conhecimento, dolmen, CIDOC-CRM, gráfico de propriedades rotuladas, Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Преобразование модели CIDOC-CRM в граф свойств мегалитического памятника

В данной статье представлен метод хранения информации о строительных компонентах мегалитических памятников в виде узлов графа знаний (KG). В качестве примера мы анализируем дольмены из региона Павия (Португалия). Для создания KG информация была извлечена из неструктурированных данных для заполнения модели схемы на основе концептуальной эталонной модели Международного комитета документации (CIDOC-CRM). Чтобы подготовить информацию об археологическом памятнике к массовой загрузке, она была преобразована в полуструктурированные данные. Хотя полуструктурированный файл использовался для заполнения классов соответствующими свойствами и экземплярами, метки и типы KG были определены с использованием некоторых сущностей и отношений, определенных CIDOC-CRM. Модель, основанная на знаниях, была создана для формального и структурированного представления дольменов с использованием Neo4J, базы данных графов свойств. Моделирование помеченного графа свойств на основе предопределенных меток в виде KG позволяет преобразовывать текстовые семантические данные в экземпляры и свойства. Таким образом, мы показываем, что можно представить на гранулированном уровне всю информацию о структурных компонентах памятников, поскольку неоднородности, детализация и большие объемы данных могут обрабатываться КР. Таким образом, KG, реализованный с использованием собственной графовой базы данных, может улучшить хранение и обработку данных, делая их совместимыми как между людьми, так и между людьми и машинами, а также между машинами.

граф знаний, дольмен, CIDOC-CRM, граф помеченных свойств, Neo4J

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

将 CIDOC-CRM 模型转变为巨石纪念碑财产图

本文提出了一种将巨石纪念碑的建筑组件信息存储为知识图谱（KG）中的图节点的方法。作为案例研究，我们分析了帕维亚（葡萄牙）地区的支石墓。为了构建知识图谱，我们从非结构化数据中提取信息，以填充基于国际文档委员会概念参考模型 (CIDOC-CRM) 的模式模型。为了准备批量加载考古纪念碑的信息，将其转换为半结构化数据。虽然半结构化文件用于填充类及其各自的属性和实例，但 KG 标签和类型是使用 CIDOC-CRM 定义的一些实体和关系来定义的。知识驱动模型是使用 Neo4J（一种属性图数据库）以正式且结构化的方式表示支石墓而构建的。基于预定义标签将带标签的属性图建模为知识图谱，可以将文本语义数据转换为实例和属性。因此，我们表明可以在粒度级别上表示有关纪念碑结构组件的所有信息，因为知识图谱可以处理异质性、粒度和大量数据。因此，使用原生图数据库实现的知识图谱可以改善数据存储和处理，使其能够在人与人之间、人与机器之间以及机器与机器之间进行互操作。

知识图谱、支石墓、CIDOC-CRM、标记属性图、Neo4J

Submission: posted 29 May 2023, validated 01 June 2023
Recommendation: posted 22 December 2023, validated 05 January 2024

Cite this recommendation as:
Huvila, I. (2023) Informative description of a project implementing a CIDOC-CRM based native graph database for representing megalithic information. Peer Community in Archaeology, 100338. https://doi.org/10.24072/pci.archaeo.100338

Recommendation

The paper “Transforming the CIDOC-CRM model into a megalithic monument property graph” describes an interesting endeavour of developing and implementing a CIDOC-CRM based knowledge graph using a native graph database (Neo4J) to represent megalithic information (Câmara et al. 2023). While there are earlier examples of using native graph databases and CIDOC-CRM in diverse heritage contexts, the present paper is useful addition to the literature as a detailed description of an implementation in the context of megalithic heritage. The paper provides a demonstration of a working implementation, and guidance for future projects. The described project is also documented to an extent that the paper will open up interesting opportunities to compare the approach to previous and forthcoming implementations. The same applies to the knowledge graph and use of CIDOC-CRM in the project.

Readers interested in comparing available technologies and those who are developing their own knowledge graphs might have benefited of a more detailed description of the work in relation to the current state-of-the-art and what the use of a native graph database in the built-heritage contexts implies in practice for heritage documentation beyond that it is possible and it has potentially meaningful performance-related advantages. While also the reasons to rely on using plain CIDOC-CRM instead of extensions could have been discussed in more detail, the approach demonstrates how the plain CIDOC-CRM provides a good starting point to satisfy many heritage documentation needs.

As a whole, the shortcomings relating to positioning the work to the state-of-the-art and reflecting and discussing design choices do not reduce the value of the paper as a valuable case description for those interested in the use of native graph databases and CIDOC-CRM in heritage documentation in general and the documentation of megalithic heritage in particular.

References

Câmara, A., de Almeida, A. and Oliveira, J. (2023). Transforming the CIDOC-CRM model into a megalithic monument property graph, Zenodo, 7981230, ver. 4 peer-reviewed and recommended by Peer Community in Archaeology. https://doi.org/10.5281/zenodo.7981230

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
This work was partially supported by the Fundação para a Ciência e a Tecnologia, I.P. (FCT) through the ISTAR-Iscte project UIDB/04466/2020 and UIDP/04466/2020, through the scholarship UI/BD/151495/2021.

Reviews

Evaluation round #1

DOI or URL of the preprint: https://doi.org/10.5281/zenodo.7981230

Version of the preprint: 1

Author's Reply, 08 Dec 2023

Download author's reply Download tracked changes file https://doi.org/10.24072/pci.archaeo.100338.ar1

Decision by Isto Huvila, posted 11 Oct 2023, validated 11 Oct 2023

The reviewers find the text interesting but also point out major questions and issues that would need to be thoroughly addressed before an eventual recommendation.

https://doi.org/10.24072/pci.archaeo.100338.d1

Reviewed by anonymous reviewer 1, 05 Jul 2023

In this article, a motivation for organizing archeological data in knowledge graphs is presented. The motivations and justification are well presented. But it is not a novelty per se.
It is based on an interesting case study, the presentation/explanation of the model could be improved for a publication, some elements are missing, (such as Fig.2), or are not clear (Fig. 1). Session titles, “Requirements” and “Methodology” are not quite appropriate for their session´s contents.
Fig 2. is referred to in the text, but not found in the paper, if that was referring to Table 2, it does not seem to show or exemplifies the terminology but presents references to vocabulary source.
A brief overview of the literature is given pointing out the differences in their proposal of KGs. However, they pose a too strong claim that no work was found for the representation of buildings and architectural remains in archaeology, especially aiming at the extraction, reusability, and interpretation of the information by machines, while Santos 2022 and Gergatsoulis et al. 2022 are intended for that.
It could be helpful if the “Requirements” section includes also examples of representing and querying architectural monument data and the trade-offs involved in choosing Neo4j (or any NGDBs) for the study.
The authors claim: In this paper, we only use CIDOC-CRM definitions and it is not discussed its extensions for Archaeology. Here a justification of this decision, and/or a comparison with such extensions would be required.
The CSV is available in Github, which is nice, but it requires seeing the CSV to have a better idea of the model, a description would be important.
It would be interesting to quantify the elements represented in the graphs, corresponding to the 94 dolmens analysed, including missing data, and also present some query examples, or discuss future applications in more concrete scenarios.
“Megalistimo Alentejano” and other non-English words should be in italic, for example, and the definition could be more clear for non-Portuguese speakers.
“Archaeologica Letter” has a typo and it’s not the correct translation for Carta Arqueológica.

Typos
… its components it’s based …
… interest in standartised access ...
... it’s based at the E22… -> based on?
… that allow describe the monumento ...

https://doi.org/10.24072/pci.archaeo.100338.rev11

Reviewed by anonymous reviewer 2, 03 Oct 2023

Comments by sections

Introduction

The opening sentence appears to be quite a broad statement. In fact, similar information is available in databases from numerous archaeological services throughout Europe. However, the accessibility of these databases presents a separate issue.

Requeriments

This section provides an update on existing methods for creating graphs. It is too technical for an archaeology article, what it is not a real issue. However, I don't understand how it fits into the structure of the paper, as it is closely related to the methodology

Related Literature

The cited literature may be sufficient, although many works are missing. A thorough review is certainly not expected, but the selection seems to be very carefully chosen to demonstrate the novelty of the work.
I find the last sentence of the section especially 'striking' for its imprecision, where it is claimed that none of the works have addressed a representation of architectural elements in Archaeology. As far as I know, it seems that some of these works have indeed addressed it. See for example Table 2, which contradicts your statement. Furthermore, the statement appears to be rather generalistic once again, likely not taking into account a more thorough literature search.

Methodology

Bueno Ramírez 2013 is mentioned in the text, but is not cited in the bibliography. I find it striking once again that a reference to a Spanish author is used, when in fact the quantifications of megaliths come from Portuguese authors, some of them at the University of Évora, for example, Leonor Rocha.
"Megalitismo Alentejano" is a Portuguese expression It is not necessary to use Portuguese instead of English in this case, but if that is the choice it should be in italics.
"Orthostats" is slabs in English.
The English in this section appears to be ackward in many senses. The description of what a dolmen is supposed to be is unclear. Perhaps someone who is not a specialist in the field would not understand from the description what we are referring to.
"Archaeologica Letter" is a direct translation from Portuguese and doesn't make any sense in English.
At the end of the 'Data Model' section, a dolmen is explained again, even though it has already been described earlier. The description doesn't seem to be very 'critical'; it is once more overly general and doesn't take into account the peculiarities of Portugal.

Overview of the Approach

I understand that the role of this section is purely methodological; it describes the implementation of the model and does not offer tangible results.

Conclusion and Future Work

I believe the conclusions merely repeat ideas already articulated in previous sections and do not offer anything beyond generalities that are not really useful for assessing the impact of the proposal

General comments

Illustrations

A map with the study area seems mandatory

Honestly, I understand the good intentions behind the work, but I think it suffers from many issues. It is not a proper case study; it's merely a technical proof of concept where too many elements are being attempted to be tied together, which is ultimately not reflected in either the results or the conclusions. I don't see how this structure can aid semi-automatic remote sensing (of which, by the way, many references are missing), for example. I also don't see how this can translate into a tool for interpreting the past, which is what we archaeologists are looking for. I only see its technical utility with proper development, but not its repercussions.

The structure of the paper appears to be somewhat complex. The text seems to reflect two distinct perspectives: one that is technical and proficient in the use of ontological modeling tools, and another that aims to explore these tools' applicability in the field of Archaeology. Unfortunately, the latter aspect seems quite underdeveloped in comparison to the former. This discrepancy creates a certain level of conceptual ambiguity and imprecision in terminological translation. There are also moments where the paper leans towards generalizations and may benefit from more rigorous bibliographic support. Additionally, there is no discussion comparing this work to others, such as the study by Santos et al. 2022, which focuses on the same area. Does this work represent an improvement or does it complement the previous research?Additionally, I observed that there could be more reflection on the potential utility of the tool in question. In conclusion, while the technical aspects of the paper are well-developed, the overall structure might benefit from clarification. This leads me to think that the article may be more appropriately aimed at an audience specialized in semantic models.

The data table raises several questions, both in terms of its design and the information it contains. Some of its contents appear to be 'constants,' such as the units of measurement. The treatment of chronology also seems to be less than optimal, especially considering the specific characteristics associated with this type of burial sites. The data are limited and often puzzling, such as the unknown status of the funerary chamber for all the sites.

I would suggest reviewing the English in the sections dedicated to Archaeology for clarity and accuracy. It may also be beneficial to delve deeper into the concepts, avoiding broad generalizations. A thorough review of the bibliography and its relevance could also add value to the work. Additionally, I recommend a careful reconsideration of the paper's overall structure for improved coherence.

https://doi.org/10.24072/pci.archaeo.100338.rev12

User comments

Robert Lewis, 2024-01-17 00:24:03

Thanks to the authors for sharing this graph database case study. For typical datasets in archaeology (e.g. less than 10,000 records), the difference in query performance between relational and graph databases is generally negligible. What is more significant in such cases is the ease or difficulty of writing queries to extract the desired information. The relative merits of SQL (for relational databeses) vs Cypher (for Neo4j) depend on the data and the type of question being asked. My thought is that graph databases are particularly useful for interrogating data with hierarchical structure, as with the representation of the physical components of dolmens in this paper, or with transitive relationships, as often found in chronological modelling.

or Register
Submit a preprint