Close printable page

Recommendation

A practical computational approach to stratigraphic analysis using conjoinable material culture.

Hector A. Orengo based on reviews by Robert Bischoff, Matthew Peeples and 1 anonymous reviewer

A recommendation of:

The strength of parthood ties. Modelling spatial units and fragmented objects with the TSAR method – Topological Study of Archaeological Refitting

Sébastien Plutniak (2021), OSF Preprints, q2e69, ver. 3 peer-reviewed and recommended by PCI Archaeo. https://doi.org/10.31219/osf.io/q2e69

Read preprint in preprint server Now published in a journal

Data used for results

Codes used in this study

Scripts used to obtain or analyze results

Abstract

EN

AR

ES

FR

HI

JA

PT

RU

ZH-CN

The strength of parthood ties. Modelling spatial units and fragmented objects with the TSAR method – Topological Study of Archaeological Refitting

Refitting and conjoinable pieces have long been used in archaeology to assess the consistency of discrete spatial units, such as layers, and to evaluate disturbance and post-depositional processes. The majority of current methods, despite their differences, rely on the count and proportion of refits within and between spatial units. Little attention is paid to the distribution and topology of the relationships between fragments, although this is now known to have significant effects on archaeological interpretation. This paper presents a new methodological approach for refitting studies. The TSAR approach (Topological Study of Archaeological Refitting) draws on concepts and methods from graph theory to model the network of connections observed between refitting fragments. Measures of cohesion and admixture of spatial units are defined using the structural properties of the sets of refitting relationships. To ensure reproducibility and reusability, the TSAR method is implemented as an R package, which also includes a simulator generating refitting fragments scattered in two spatial units. The advantages of the topological approach are discussed by comparing it to: 1) the results of a survey in which archaeologists were asked to rank examples of stratigraphic admixture; and 2) other computational methods. The approach is applied to simulated data, and empirical data from the Liang Abu rock shelter (East Borneo) are presented. Finally, the use of the TSAR simulation approach to test different scenarios of site formation processes is demonstrated.

refitting, graph analysis, network analysis, stratigraphy, post-depositional process, taphonomy, software

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

قوة روابط الفراق. نمذجة الوحدات المكانية والأشياء المجزأة بطريقة TSAR – دراسة طوبولوجية للتجديد الأثري

لطالما تم استخدام القطع القابلة للتجديد والتركيب في علم الآثار لتقييم اتساق الوحدات المكانية المنفصلة، مثل الطبقات، ولتقييم الاضطراب وعمليات ما بعد الترسيب. تعتمد غالبية الطرق الحالية، على الرغم من اختلافاتها، على عدد ونسبة التجديدات داخل الوحدات المكانية وفيما بينها. تم إيلاء القليل من الاهتمام لتوزيع وطوبولوجيا العلاقات بين الأجزاء، على الرغم من أنه من المعروف الآن أن هذا له تأثيرات كبيرة على التفسير الأثري. تقدم هذه الورقة نهجا منهجيا جديدا لتجديد الدراسات. يعتمد نهج TSAR (الدراسة الطوبولوجية للتجديد الأثري) على مفاهيم وأساليب من نظرية الرسم البياني لنمذجة شبكة الاتصالات التي تمت ملاحظتها بين أجزاء التجديد. يتم تحديد مقاييس التماسك والاختلاط بين الوحدات المكانية باستخدام الخصائص الهيكلية لمجموعات علاقات التجديد. ولضمان إمكانية التكرار وإعادة الاستخدام، يتم تنفيذ طريقة TSAR كحزمة R، والتي تتضمن أيضًا جهاز محاكاة يولد أجزاء تجديد منتشرة في وحدتين مكانيتين. تتم مناقشة مزايا النهج الطوبولوجي من خلال مقارنته بما يلي: 1) نتائج المسح الذي طُلب فيه من علماء الآثار ترتيب أمثلة على الاختلاط الطبقي؛ و 2) الطرق الحسابية الأخرى. يتم تطبيق هذا النهج على البيانات المحاكاة، ويتم عرض البيانات التجريبية من ملجأ ليانغ أبو الصخري (شرق بورنيو). وأخيرًا، تم توضيح استخدام أسلوب محاكاة TSAR لاختبار سيناريوهات مختلفة لعمليات تشكيل الموقع.

التجديد، تحليل الرسوم البيانية، تحليل الشبكات، علم الطبقات، عملية ما بعد الترسيب، علم النقرات، البرمجيات

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

La fuerza de los lazos de parcialidad. Modelado de unidades espaciales y objetos fragmentados con el método TSAR – Estudio Topológico de Rehabilitación Arqueológica

Las piezas reacondicionadas y unibles se han utilizado durante mucho tiempo en arqueología para evaluar la consistencia de unidades espaciales discretas, como capas, y para evaluar perturbaciones y procesos posdeposicionales. La mayoría de los métodos actuales, a pesar de sus diferencias, se basan en el recuento y la proporción de reacondicionamientos dentro y entre unidades espaciales. Se presta poca atención a la distribución y topología de las relaciones entre fragmentos, aunque ahora se sabe que esto tiene efectos significativos en la interpretación arqueológica. Este artículo presenta un nuevo enfoque metodológico para los estudios de refitting. El enfoque TSAR (Estudio Topológico de Refitting Arqueológico) se basa en conceptos y métodos de la teoría de grafos para modelar la red de conexiones observadas entre fragmentos de refitting. Las medidas de cohesión y mezcla de unidades espaciales se definen utilizando las propiedades estructurales de los conjuntos de relaciones de reajuste. Para garantizar la reproducibilidad y la reutilización, el método TSAR se implementa como un paquete R, que también incluye un simulador que genera fragmentos de reacondicionamiento dispersos en dos unidades espaciales. Las ventajas del enfoque topológico se discuten comparándolo con: 1) los resultados de un estudio en el que se pidió a los arqueólogos que clasificaran ejemplos de mezcla estratigráfica; y 2) otros métodos computacionales. El enfoque se aplica a datos simulados y se presentan datos empíricos del refugio rocoso de Liang Abu (Borneo Oriental). Finalmente, se demuestra el uso del enfoque de simulación TSAR para probar diferentes escenarios de procesos de formación de sitios.

reacondicionamiento, análisis de gráficos, análisis de redes, estratigrafía, proceso post-deposicional, tafonomía, software

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

La force des liens de solidarité. Modélisation d'unités spatiales et d'objets fragmentés avec la méthode TSAR – Etude topologique des réaménagements archéologiques

Les pièces de réaménagement et de jonction sont utilisées depuis longtemps en archéologie pour évaluer la cohérence d'unités spatiales discrètes, telles que les couches, et pour évaluer les perturbations et les processus post-dépôt. La majorité des méthodes actuelles, malgré leurs différences, reposent sur le nombre et la proportion de radoubs au sein et entre les unités spatiales. Peu d'attention est accordée à la distribution et à la topologie des relations entre fragments, même si l'on sait désormais que cela a des effets significatifs sur l'interprétation archéologique. Cet article présente une nouvelle approche méthodologique pour le réaménagement des études. L'approche TSAR (Topological Study of Archaeological Refitting) s'appuie sur les concepts et méthodes de la théorie des graphes pour modéliser le réseau de connexions observé entre les fragments de refitting. Les mesures de cohésion et de mélange d'unités spatiales sont définies à l'aide des propriétés structurelles des ensembles de relations de réaménagement. Pour garantir la reproductibilité et la réutilisabilité, la méthode TSAR est implémentée sous forme de package R, qui comprend également un simulateur générant des fragments de réaménagement dispersés dans deux unités spatiales. Les avantages de l'approche topologique sont discutés en la comparant : 1) aux résultats d'une enquête dans laquelle les archéologues ont été invités à classer des exemples de mélange stratigraphique ; et 2) d'autres méthodes de calcul. L'approche est appliquée à des données simulées et des données empiriques de l'abri sous roche de Liang Abu (Est de Bornéo) sont présentées. Enfin, l'utilisation de l'approche de simulation TSAR pour tester différents scénarios de processus de formation de sites est démontrée.

réaménagement, analyse de graphes, analyse de réseaux, stratigraphie, processus post-dépôt, taphonomie, logiciels

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

पार्थहुड संबंधों की ताकत. टीएसएआर विधि के साथ स्थानिक इकाइयों और खंडित वस्तुओं की मॉडलिंग - पुरातात्विक रीफिटिंग का टोपोलॉजिकल अध्ययन

परतों जैसी असतत स्थानिक इकाइयों की स्थिरता का आकलन करने और गड़बड़ी और जमाव के बाद की प्रक्रियाओं का मूल्यांकन करने के लिए पुरातत्व में रीफिटिंग और संयोजित टुकड़ों का लंबे समय से उपयोग किया जाता रहा है। मौजूदा तरीकों में से अधिकांश, अपने मतभेदों के बावजूद, स्थानिक इकाइयों के भीतर और उनके बीच रिफिट की गिनती और अनुपात पर निर्भर करते हैं। टुकड़ों के बीच संबंधों के वितरण और टोपोलॉजी पर थोड़ा ध्यान दिया जाता है, हालांकि अब यह पुरातात्विक व्याख्या पर महत्वपूर्ण प्रभाव डालने के लिए जाना जाता है। यह पेपर रिफ़िटिंग अध्ययन के लिए एक नया पद्धतिगत दृष्टिकोण प्रस्तुत करता है। टीएसएआर दृष्टिकोण (पुरातात्विक रीफिटिंग का टोपोलॉजिकल अध्ययन) रीफिटिंग टुकड़ों के बीच देखे गए कनेक्शन के नेटवर्क को मॉडल करने के लिए ग्राफ सिद्धांत से अवधारणाओं और तरीकों पर आधारित है। स्थानिक इकाइयों के सामंजस्य और मिश्रण के उपायों को रिफिटिंग संबंधों के सेट के संरचनात्मक गुणों का उपयोग करके परिभाषित किया गया है। प्रतिलिपि प्रस्तुत करने योग्यता और पुन: प्रयोज्यता सुनिश्चित करने के लिए, टीएसएआर विधि को आर पैकेज के रूप में लागू किया गया है, जिसमें दो स्थानिक इकाइयों में बिखरे हुए रीफिटिंग टुकड़े उत्पन्न करने वाला एक सिम्युलेटर भी शामिल है। टोपोलॉजिकल दृष्टिकोण के फायदों की तुलना इसकी तुलना करके की जाती है: 1) एक सर्वेक्षण के परिणाम जिसमें पुरातत्वविदों को स्ट्रैटिग्राफिक मिश्रण के उदाहरणों को रैंक करने के लिए कहा गया था; और 2) अन्य कम्प्यूटेशनल तरीके। दृष्टिकोण को सिम्युलेटेड डेटा पर लागू किया जाता है, और लियांग अबू रॉक शेल्टर (पूर्वी बोर्नियो) से अनुभवजन्य डेटा प्रस्तुत किया जाता है। अंत में, साइट निर्माण प्रक्रियाओं के विभिन्न परिदृश्यों का परीक्षण करने के लिए टीएसएआर सिमुलेशन दृष्टिकोण का उपयोग प्रदर्शित किया गया है।

रीफ़िटिंग, ग्राफ़ विश्लेषण, नेटवर्क विश्लेषण, स्ट्रैटिग्राफी, पोस्ट-डिपोज़िशनल प्रक्रिया, टैफ़ोनोमी, सॉफ़्टवेयर

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

仲間の絆の強さ。 TSAR 法を使用した空間単位と断片化されたオブジェクトのモデリング – 考古学的な改修のトポロジカルな研究

再取り付けおよび結合可能な断片は、層などの個別の空間単位の一貫性を評価したり、撹乱や堆積後のプロセスを評価したりするために、考古学で長い間使用されてきました。現在の方法の大部分は、その違いはあるものの、空間単位内および空間単位間の再調整の数と割合に依存しています。断片間の関係の分布やトポロジーにはほとんど注意が払われていませんが、これが考古学的解釈に重大な影響を与えることが現在では知られています。この論文では、研究を再調整するための新しい方法論的アプローチを紹介します。 TSAR アプローチ (考古学的再調整のトポロジカル研究) は、グラフ理論の概念と方法を利用して、再調整された断片間で観察される接続ネットワークをモデル化します。空間単位の凝集性と混合性の尺度は、再フィッティング関係のセットの構造特性を使用して定義されます。再現性と再利用性を確保するために、TSAR メソッドは R パッケージとして実装されており、これには 2 つの空間ユニットに散在する再フィッティングフラグメントを生成するシミュレーターも含まれています。トポロジー的アプローチの利点は、以下と比較することによって議論されます。 1) 考古学者が層位学的混合の例をランク付けするよう依頼された調査の結果。 2) 他の計算方法。このアプローチはシミュレーションデータに適用され、リャンアブ岩石シェルター (東ボルネオ) からの実証データが提示されます。最後に、TSAR シミュレーションアプローチを使用して、サイト形成プロセスのさまざまなシナリオをテストする方法を示します。

再フィッティング、グラフ解析、ネットワーク解析、層序学、堆積後プロセス、タフォノミー、ソフトウェア

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

A força dos laços de parceria. Modelagem de unidades espaciais e objetos fragmentados com o método TSAR – Estudo Topológico de Reequipamento Arqueológico

Peças de reajuste e junções têm sido usadas há muito tempo em arqueologia para avaliar a consistência de unidades espaciais discretas, como camadas, e para avaliar perturbações e processos pós-deposicionais. A maioria dos métodos atuais, apesar das suas diferenças, baseiam-se na contagem e proporção de reajustes dentro e entre unidades espaciais. Pouca atenção é dada à distribuição e topologia das relações entre os fragmentos, embora se saiba agora que isso tem efeitos significativos na interpretação arqueológica. Este artigo apresenta uma nova abordagem metodológica para reequipamento de estudos. A abordagem TSAR (Estudo Topológico de Reequipamento Arqueológico) baseia-se em conceitos e métodos da teoria dos grafos para modelar a rede de conexões observadas entre fragmentos de reequipamento. As medidas de coesão e mistura de unidades espaciais são definidas utilizando as propriedades estruturais dos conjuntos de relações de reajuste. Para garantir a reprodutibilidade e reutilização, o método TSAR é implementado como um pacote R, que também inclui um simulador que gera fragmentos de reequipamento espalhados em duas unidades espaciais. As vantagens da abordagem topológica são discutidas comparando-a com: 1) os resultados de uma pesquisa na qual os arqueólogos foram solicitados a classificar exemplos de mistura estratigráfica; e 2) outros métodos computacionais. A abordagem é aplicada a dados simulados, e são apresentados dados empíricos do abrigo rochoso Liang Abu (Bornéu Oriental). Finalmente, é demonstrado o uso da abordagem de simulação TSAR para testar diferentes cenários de processos de formação de sítios.

reequipamento, análise gráfica, análise de redes, estratigrafia, processo pós-deposicional, tafonomia, software

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

Сила партпартийных связей. Моделирование пространственных единиц и фрагментированных объектов методом TSAR — Топологическое исследование археологического переоборудования

Перестановка и соединение частей уже давно используются в археологии для оценки согласованности отдельных пространственных единиц, таких как слои, а также для оценки нарушений и процессов после отложений. Большинство современных методов, несмотря на их различия, основаны на подсчете и доле ремонтов внутри и между пространственными единицами. Мало внимания уделяется распределению и топологии взаимоотношений между фрагментами, хотя сейчас известно, что это оказывает существенное влияние на археологическую интерпретацию. В данной статье представлен новый методологический подход к исследованиям переоборудования. Подход TSAR (топологическое исследование археологической переоборудования) опирается на концепции и методы теории графов для моделирования сети связей, наблюдаемых между фрагментами переоборудования. Меры сцепления и смешения пространственных единиц определяются с использованием структурных свойств наборов переоснащающих отношений. Для обеспечения воспроизводимости и возможности повторного использования метод TSAR реализован в виде пакета R, в который также входит симулятор, генерирующий фрагменты переоборудования, разбросанные в двух пространственных единицах. Преимущества топологического подхода обсуждаются путем сравнения его с: 1) результатами исследования, в ходе которого археологам было предложено ранжировать примеры стратиграфической примеси; и 2) другие вычислительные методы. Подход применен к смоделированным данным, и представлены эмпирические данные из каменного убежища Лян Абу (Восточное Борнео). Наконец, демонстрируется использование подхода моделирования TSAR для тестирования различных сценариев процессов формирования сайтов.

переоборудование, графовый анализ, сетевой анализ, стратиграфия, постседиментационный процесс, тафономия, программное обеспечение

This is an automatically generated version. The authors and PCI decline all responsibility concerning its content

伙伴关系的强度。用TSAR方法对空间单元和碎片物体进行建模——考古改装的拓扑研究

考古学中长期以来一直使用改装和可连接部件来评估离散空间单元（例如层）的一致性，并评估干扰和沉积后过程。目前的大多数方法尽管存在差异，但都依赖于空间单元内和空间单元之间改装的数量和比例。人们很少关注碎片之间关系的分布和拓扑，尽管现在已知这对考古学解释有重大影响。本文提出了改装研究的新方法。 TSAR 方法（考古改装的拓扑研究）借鉴图论的概念和方法，对改装碎片之间观察到的连接网络进行建模。空间单元的凝聚力和混合度的测量是使用整修关系组的结构属性来定义的。为了确保可重复性和可重用性，TSAR 方法以 R 包的形式实现，其中还包括一个模拟器，生成分散在两个空间单元中的改装片段。通过将拓扑方法与以下内容进行比较来讨论拓扑方法的优点：1）一项调查结果，其中考古学家被要求对地层混合物的例子进行排序； 2）其他计算方法。该方法应用于模拟数据，并提供了梁阿布岩石庇护所（东婆罗洲）的经验数据。最后，演示了使用TSAR模拟方法来测试不同场景的站点形成过程。

改装、图形分析、网络分析、地层学、沉积后过程、埋藏学、软件

Submission: posted 14 January 2021
Recommendation: posted 24 June 2021, validated 24 June 2021

Cite this recommendation as:
Orengo, H. (2021) A practical computational approach to stratigraphic analysis using conjoinable material culture.. Peer Community in Archaeology, 100010. https://doi.org/10.24072/pci.archaeo.100010

Recommendation

The paper by Plutniak [1] presents a new method that uses refitting to help interpret stratigraphy using the topological distribution of conjoinable material culture. This new method opens up new avenues to the archaeological use of network analysis but also to assess the integrity of interpreted excavation layers. Beyond its evident applicability to standard excavation practice, the paper presents a series of characteristics that exemplify archaeological publication best practices and, as someone more versed in computational than in refitting studies I would like to comment upon.

It was no easy task to find adequate reviewers for this paper as it combines techniques and expertise that are not commonly found together in individual researchers. However, Plutniak, with help from three reviewers, particularly M. Peeples, a leading figure in archaeological applications of network science, makes a considerable effort to be accessible to non-specialist archaeologists. The core Topological Study of Archaeological Refitting (TSAR) method is freely accessible as the R package archeofrag, which is available at the Comprehensive R Archive Network (https://CRAN.R-project.org/package=archeofrag) that can be applied without the need to understand all its mathematical, graph theory and coding aspects. Beside these, an online interface including test data has been provided (https://analytics.huma-num.fr/Sebastien.Plutniak/archeofrag/), which aims to ease access to the method to those archaeologists inexperienced with R. Finally, supplementary material showing how to use the package and evaluating its potential through excellent examples is provided as both pdf and Rmw (Sweave) files. This is an important companion for the paper as it allows a better understanding of the methods presented in the paper and its practical application.

The author shows particular care in testing the potential and capabilities of the method. For example, a function is provided “frag.observer.failure” to test the robustness of the edge count method against the TSAR method, which is able to prove that TSAR can deal well with incomplete information. As a further step in this direction both simulated and real field-acquired data are used to test the method which further proves that archeofrag is not only able to quantitatively assess the mixture of excavated layers but to propose meaningful alternatives, which no doubt will add an increased methodological consistency and thoroughness to previous quantitative approaches to material refitting work, even when dealing with very complex stratigraphies.

All in all, this paper makes an important contribution to core archaeological practice through the use of innovative, reproducible and accessible computational methods. I fully endorse it for the conscious and solid methods it presents but also for its adherence to open publication practices. I hope that it can become of standard use in the reconstruction of excavated stratigraphical layers through conjoinable material culture.

[1] Plutniak, S. 2021. The Strength of Parthood Ties. Modelling Spatial Units and Fragmented Objects with the TSAR Method – Topological Study of Archaeological Refitting. OSF Preprints, q2e69, ver. 3 Peer-reviewed and recommended by PCI Archaeology. https://doi.org/10.31219/osf.io/q2e69.

PDF recommendation

Conflict of interest:
The recommender in charge of the evaluation of the article and the reviewers declared that they have no conflict of interest (as defined in the code of conduct of PCI) with the authors or with the content of the article. The authors declared that they comply with the PCI rule of having no financial conflicts of interest in relation to the content of the article.

Funding:
no declaration

Reviews

Evaluation round #1

DOI or URL of the preprint: https://osf.io/q2e69

Author's Reply, 28 Apr 2021

I thank the three reviewers for their valuable comments and suggestions, which helped a lot to improve this paper. In addition to the change suggested by the reviewers (which are detailed below), this second version comes with all the required suppelmentary material, including a new version of the archeofrag package (v0.7).

I would also appreciate the Rmd file as an additional supplemental document for easier reproducibility, but the code can be copied from the pdf

The Rmd file has been added to the supplementary materials. I also checked and ensured that all the materials needed to reproduce the content of this paper are provided.

1) On Page 3, the first Figure and the discussion related to it are somewhat abstract. I would suggest to add a real-world archaeological example to explain to the dumb folks like me what this actually means in a concrete archaeological example. Otherwise the abstraction might scare people off.

A concrete example is given in Figure 2. It might be better to have Figure 2 before Figure 1, but this is not optimal for the sake of the presentation. I assume that if things are unclear to the reader in Figure 1, it should become clear few paragraphs later with Figure 2.

2) It would be nice, although not absolutely necessary, to see how this method could be deployed in conjunction with other methods of stratigraphic analysis, as well as Bayesian modelling of radiocarbon dates. I would think that this method would be extremely useful in fine-tuning the modelling of C14 dates. It would be useful if the author could maybe comment on that and give an idea how this method might be used in combination with others?

This is an interesting suggestion, but far beyond the scope of this paper. Development of the TSAR method (or based on this method) are ongoing and not presented in this first paper. For example, the development version of the archeofrag package includes (from version 0.6.6, see https://github.com/sebastien-plutniak/archeofrag) new methods to weight the edges based on morphometric values of the objects and on the spatial distance between them.

3) I guess this method is still under development, but I wonder if the author might be able to comment a little more on how we could make this method something more commonplace? At the moment, it seems fairly inaccessible, so what will be the pathway to making it more accessible to a wide range of archaeologists (especially those like me who are a bit challenged by the maths)?

This paper is intended to give an in-depth presentation of a method that can be used without a complete understanding of all the aspects addressed in this paper. The "archeofrag" R package offers a set of functions to apply the method without going into its mathematical aspects. Given that not all archaeologists manage the R language, I also developed an online interface from which all users can experiment (https://analytics.huma-num.fr/Sebastien.Plutniak/archeofrag). This online application, with few options for the moment, will be further developed in the future.

The “preliminary definitions” section includes some useful information, but I also think this could be reduced not just to save space but also to add to the clarity of the argument.

The reference to connections relying on “solid-state physics” and then the next sub-section with the discussion of Queen and Rook contiguity and the analogy to spatial analysis made a pretty clear distinction a bit more muddled in my reading.
I think a very simple definition of connection and similarity refits and Figure 2 is plenty and felt that the Queen/Rook example did not add clarity and was not referred to again so could be removed.

section 2.1.3 Topological properties could be basically eliminated

I added a figure and revised the text for clarity. It seems important to keep the paragraphs on contiguity, because it is an important aspect when recording the data for refitting analysis. Consequently, I think it must be addressed in the main body of the text (and not in the supplementary material), because the user of the TSAR method will face this problem.

section 2.1.2 could be simplified to make the two basic points that are necessary for the rest of the paper

1) archaeologists identify potential refits by either identifying pieces that fit together across substantial surfaces in 3D space or alternatively by looking for similarities in the objects themselves

2) you are only focused on the former in this paper.

I simplified this paragraph by removing the part on the epistemological aspects of the two types of relationships.

It wasn’t clear to me how the “merge graphs” step in the two initial layers works. I can think of several ways you might have chosen to implement this, but I do not think the choices you made are completely clear in this draft or the supplement. I think a short paragraph would be needed to address this.

I rephrased this part, hoping that it is clear now. About the way the algorithm works, I also added a description in pseudo-code, which complements the flowchart diagram. The reader interested in learning more about it can read the code of the function, included in the archeofrag package and written with abundant comments.

the degree to which missing connections would hamper our ability to connect empirical patterns to generative processes. Given the nature of fragmentation and post-depositional processes, I expect that it would be common for archaeologists to miss refits due to damage to potential conjoining edges of pieces or all manner of other processes. How would this influence your results? I think this issue merits some discussion at least and perhaps some small analyses.

I had one suggestion: missing connections could potentially be evaluated with an additional global parameter such as a probability that a connection is removed once the graphs have been formed to essentially simulate “missed” or unidentifiable connections. Especially with the focus on “connections” rather than similarity, I think it’s likely that missed connections would be present in any empirical refit study and this simulation model provides a setup that could help evaluate the impacts.

Thank you for this very inspiring suggestion. I developed a new function in the package to simulate missed connections (the "frag.observer.failure" function), and used it to test the robustness of the edge count method versus the TSAR method. This gave a strong and additional argument supporting the TSAR method, demonstrating that it is more robust and less sensible to the “lack” of information. I shortly reported these results in section 3.2.2 (to not overload the main text) and referred to the supplementary materials for details.

section 3.1.1 you compare the admixture model and others to the archaeological intuition. Given the variability in the archaeological intuitive coding, I don’t know that the methods used here fitting in the same cluster is a particularly convincing argument in favor of these new methods (and that’s okay).

To me, the results you present suggest that people are generally not great at evaluating the relative order of topological patterns and that, in and of itself, argues for some sort of automated approach like you’ve developed here.

The clustering diagram and the discussion in the text is a bit confusing, however, as it stands as not every bar in the cluster diagram is labeled and it’s unclear which “secondary clusters” you are referring to in the text.

Beyond this, the Figure 8 caption says you are evaluating the four methods but it’s not entirely clear to me which is represented by which label in the existing cluster tree diagram. I think this figure will need to be modified.

I modified the figure and its caption, making them more clear, and revised the text as suggested.

The results discussed and shown in Figure 9 are helpful for thinking about how different values for these variables change with variation in balance and disturbance. I think this figure could be improved with a bit more guiding text. The x-axis is labeled Cohesion but Admixture is shown on this same axis with the same 0-1 scale so I would suggest changing that. It would also help to label the rows and columns with balance and disturbance respectively. Finally, I would suggest just increasing the border for the Admixture color as it isn’t really visible in the plots further to the left.

Also, I think this is an excellent place for you to return to your Table 1 insights and discuss how you might interpret various of these plots in light of that table.

I improved the readability of the figure as suggested (I modified the labels and increased the border of the red shades representing the admixture IQR). The caption of this figure and the text have been re-written, with reference to Table 1 (which has been modified, limiting its use for direct interpretation in terms of post-depositional processes). The correspondence between the Figure with simulated results and the Table has also been enhanced by reordering their rows and colums in similar fashion.

Figure 10: It would be interesting if these were also associated with a non-parametric correlation test like Spearman’s rho just to get a sense of how they differ in terms of rank order correlations.

Consider splitting Figure 11 into two figures so that the labels in the boxplot are clearer.

I modified the figure but kept the two plot alongside (to avoid too many figures and, above all, because they represent the same data and it makes sense to have them next to each other — I would have preferred to use margin boxplots, but there is a limitation in ggplot2, it is not possible to use facets and margins plots at the same time). In addition, as suggested, I computed the Spearman coefficients on the ranks of the 10 theoretical graphs computed by the four methods. A table reports the results, which are also discussed in the text.

The empirical example from Liang Abu is informative for thinking about how this all would actually work in practice. Notably, the values for cohesion differ in a pretty obvious way but the other three measures (excluding Modularity) show pretty minor differences (and all show 0 for the 0&1 comparison which makes sense). How might you think about these results in relation to your schema in table 1? Is the difference in Cohesion substantial? What would you interpret as a meaningful difference here? Discussing this a bit in section 3.3.1 would be helpful in walking a reader through what you’ve done and provide a justification for the hypothesis testing and simulation you do in the next section.

The discussion in this paragraph has been revised and completed accordingly, with reference to Table 1 as suggested and strengthening the relation with the use of simulation in the next section.

The hypothesis testing is an important part of the overall argument and I have some questions and need a bit of clarification in a couple of places here. I followed along with your results and also replicated them in R using the code in the supplement. I found the presentation of these results to be fairly brief given that some major interpretations hinge on them. The set up makes sense as your question is basically whether you could consider this context as having one or two layers initially. You use Wilcox test to assess rank order differences in several variables, which seems like a sensible approach and find evidence for differences in the simulated distributions and not others. Given that you have a low but non-null admixture you are really conducting these simulations to assess differences in cohesion in terms of the schema you laid out in table 1. Since scenario 3 and scenario 4 have the same interpretation (and admixture doesn’t differ for either scenario) how do the results you present in this section and Figures 12 and 13 related to the setup you initially provided in table 1? I feel like this needs more discussion. The results for specific variables differ in terms of which hypothesis is more or less supported. The results seem equivocal for several variables and looking at the supplement although there are statistically significant rank order shifts the distributions are quite similar for some variables suggesting the actual effect is small. You land on an interpretation of two independent layers but I think this needs to be better justified. Given the low admixture value and the setup in table 1, is there a scenario that would have generated different results?

Thank you for pointing this. I added a table providing a “grid” for interpretation (Table 7). The plots showing the simulated and empirical results by parameter now also include a boxplot, and the Interquartile range is used to get a better reading of the relation between the empirical value and the simulated values, and finally a more objective interpretation. However, as you suggest in your comment, the interpretation is (must be) left to the archaeologist, since the final assertion is qualitative. I stress this point in the text and present a more prudent interpretation for the Liang Abu layers 1 and 2 case study.

I would also be interested in seeing an exploration of the relationship between structural admixture and the absolute difference in cohesion for a range of scenarios.

this empirical example is only able to capture a small range of the possible scenarios that might be encountered in real settings. In general, I would like to see what kind of numbers would generate higher admixture values and how those might compare to what is shown here.

For sure, the Liang Abu case study is limited. To present a wider range of possibilities is the role of Figure 9 (now Figure 10), unless I do not understand exactly what do you mean here.

In general, the set up in the beginning of the paper outlines the relationship between admixture and cohesion in table 1 but this doesn’t come in to play in the interpretation of the empirical example and I think this makes the final interpretation of the hypothesis test less impactful. I would suggest returning to the table 1 set up in the hypothesis testing section and perhaps even discussing in more detail other realistic simulated datasets that would land you somewhere else on table 1.

I think the major thing that could be improved is the connection between the initial discussion of the method (section 2.2 in particular) and the hypothesis testing and simulation in the end.

I tried to satisfy this general demand (see answers to the previous points).

In addition to the text and supplement, I also installed and reviewed the R package. I was able to install this but I had to roll back to and earlier (3.6) version of R as one dependency was not available for the most recent version (4.0) of R. If that is an easy fix (looking at the BiocManager packages) it would be helpful to potential future users. As for the supplement, I was not able to fully replicate those results because there are a couple of files that were called in the code that weren’t provided on the OSF link as the markdown document in pdf was the only document posted. I think it would be useful to post the supplement as an Rmd file along with all of the required files to completely replicate the results. Once I was able to get the package installed in R 3.6, I was able to run all of the functions without any problems.

The compatibility of the archeofrag package is checked with continuous integration procedues (on github for macOS, Windows, and linux (Ubuntu 20.04); here: https://github.com/sebastien-plutniak/archeofrag/actions/runs/785741281, and on Travis CI for Linux Ubuntu (18.04.5 LTS), here: https://travis-ci.org/github/sebastien-plutniak/archeofrag). I hope that other users will not encounter troubles and will continue to maintain the package in the future.

https://doi.org/10.24072/pci.archaeo.100156.ar1

Decision by Hector A. Orengo, posted 04 Mar 2021

Dear Dr Plutniak,

Thank you very much for your submission to PCI Archaeology. Apologies for the amount of time it took to contact you after your initial submission. It has been very difficult to find adequate reviewers for your paper as it combines techniques and expertise that are not commonly found together in individual researchers.

I personally find the paper innovative and of interest to PCI Archaeology readers and users. After reading the positive reviews I would like to recommend it for publication after some minor issues, as detailed by reviewers 2 and 3, have been addressed. These changes should not take much of your time as they seem relatively minor.

I hope you decide to provide a corrected version of your manuscript.

All best wishes,

Hector

https://doi.org/10.24072/pci.archaeo.100156.d1

Reviewed by Robert Bischoff, 09 Feb 2021

This paper describes a novel method to assess the integrity and amount of mixing between excavation layers. The method, termed the TSAR approach (Topological Study of Archaeological Refitting), uses network typologies and edge metrics to assess the level of cohesion and admixture between two layers. The nodes (artifacts) are connected based on object refitting and adjusts for sample size. The utility of this method compared to other approaches is tested using simulated data and excavation data. The author created a package in the R programming language to facilitate the method, which is freely available from R’s package archive and includes some of the data used in the article. Furthermore, the R package allows users to generate their own test scenarios. Assessing stratigraphic integrity is often a critical task for interpretation. Refitting alone is enough to determine whether some mixing occurred but a quantitative assessment of the mixing requires a more complex methodology. This is the problem addressed by the author. I found the overview of current methods and the comparison to the proposed TSAR method convincing and reasonable. The application of network methods is, to my knowledge, a novel approach that solves a challenging problem and reveals interpretable results. The use of both simulated data and field data to test the method strengthens the author’s argument. As stated in the article, testing this method in additional scenarios is necessary but not required to justify recommending this paper. I applaud the inclusion of a pdf containing the code used for analysis and explaining the steps. The development and publishing of an R package based on this method is a significant contribution. I would also appreciate the Rmd file as an additional supplemental document for easier reproducibility, but the code can be copied from the pdf. I recommend endorsing this paper and found no major concerns that should be addressed. There are a few minor corrections I have suggested included in the pdf of the article that I have submitted.

Download the review https://doi.org/10.24072/pci.archaeo.100156.rev11

Reviewed by anonymous reviewer 1, 08 Feb 2021

I think this is a valuable paper that should be published. The paper proposes a novel method to evaluate the stratigraphic integrity of archaeological deposits by combining refitting studies with graph theory. Usually, refitting work has been an ad hoc process providing some clues for excavators on how archaeological deposits might be related. This paper add significant methodological rigor to realise the full potential of refitting studies to assess the stratigraphy at complex sites. The most immediate relevance is probably for Palaeolithic cave sites and the like, but the method is potentially relevant to a wide range of situations.

I have to point out though I am not a statistician or particularly well versed in mathematics or graph theory. The calculations in the paper are therefore a bit beyond my abilities. I therefore hope that another reviewer was assigned to this paper that has this expertise and can comment on the mathematical aspects in more detail.

Three points I would like to raise are these: 1) On Page 3, the first Figure and the discussion related to it are somewhat abstract. I would suggest to add a real-world archaeological example to explain to the dumb folks like me what this actually means in a concrete archaeological example. Otherwise the abstraction might scare people off. 2) It would be nice, although not absolutely necessary, to see how this method could be deployed in conjunction with other methods of stratigraphic analysis, as well as Bayesian modelling of radiocarbon dates. I would think that this method would be extremely useful in fine-tuning the modelling of C14 dates. It would be useful if the author could maybe comment on that and give an idea how this method might be used in combination with others? 3) I guess this method is still under development, but I wonder if the author might be able to comment a little more on how we could make this method something more commonplace? At the moment, it seems fairly inaccessible, so what will be the pathway to making it more accessible to a wide range of archaeologists (especially those like me who are a bit challenged by the maths)?

In conclusion, I think this is a solid paper and I would recommend publishing it. It can be published as it is, but perhaps the author can take some of the comments I made above into consideration.

https://doi.org/10.24072/pci.archaeo.100156.rev12

Reviewed by Matthew Peeples, 19 Feb 2021

Review of “The strength of parthood ties. Evaluating archaeological layers using graph theory to model objects refitting” by Sébastien Plutniak

This paper presents a novel approach to assessing the mixing of archaeological layers using refitting analysis and the topology of refits. This is certainly different than most previous approaches which, as the author notes, rely largely on assessing counts of refits but not the structure connections among them. I think this approach is interesting and has the potential to be useful in helping researchers think thorough and simulate different scenarios involving fragmentation, deposition, and post-depositional processes to aid in the interpretation of real archaeological data. There are some changes I would suggest that I think could help to improve the clarity and usability of this manuscript and I outline those in detail here. I go through each section of the paper and then return to some overarching comments on the method and presentation at the end. The introductory section to the paper is clear and presents a good justification for why this approach could be useful and how it expands upon the existing literature. I am not particularly familiar with the body of literature cited on refits and disturbance processes, but the article cites several classic papers and a range of newer studies that capture much of the literature I have seen.

The “preliminary definitions” section includes some useful information, but I also think this could be reduced not just to save space but also to add to the clarity of the argument. The distinction between “connection” and “similarity” is useful but I found the discussion of this distinction a bit confusing without a second glance. The reference to connections relying on “solid-state physics” and then the next sub-section with the discussion of Queen and Rook contiguity and the analogy to spatial analysis made a pretty clear distinction a bit more muddled in my reading. I section 2.1.3 could be basically eliminated and section 2.1.2 could be simplified to make the two basic points that are necessary for the rest of the paper 1) archaeologists identify potential refits by either identifying pieces that fit together across substantial surfaces in 3D space or alternatively by looking for similarities in the objects themselves and 2) you are only focused on the former in this paper. I think a very simple definition of connection and similarity refits and Figure 2 is plenty and felt that the Queen/Rook example did not add clarity and was not referred to again so could be removed.

Section 2.2 was straight forward and Table 1/Figure 4 are helpful. The method chosen for edge weighting makes sense and is clearly described with the equation as are the definitions for cohesion and admixture. The measures selected for assessing the topological properties of refits make intuitive sense as do the alternative methods and the criticisms of them (such as the issue of sample size and modularity).

The TSAR simulator is a sensible approach to evaluating the potential generative processes creating empirical patterns. There are several extensions that I could think of to this procedure (and some of those are mentioned in the conclusions) but this model is a good beginning. The parameter descriptions in table 2 and the flowcharts in Figure 6 make sense and help me understand what is happening under the hood with one exception. Reading this text, it wasn’t clear to me how the “merge graphs” step in the two initial layers works. I can think of several ways you might have chosen to implement this, but I do not think the choices you made are completely clear in this draft or the supplement. I think a short paragraph would be needed to address this.

One major question I have about this approach that is not discussed here or elsewhere is the degree to which missing connections would hamper our ability to connect empirical patterns to generative processes. Given the nature of fragmentation and post-depositional processes I expect that it would be common for archaeologists to miss refits due to damage to potential conjoining edges of pieces or all manner of other processes. How would this influence your results? I think this issue merits some discussion at least and perhaps some small analyses. I had one suggestion the author could consider here. Missing connections could potentially be evaluated with an additional global parameter such as a probability that a connection is removed once the graphs have been formed to essentially simulate “missed” or unidentifiable connections. Especially with the focus on “connections” rather than similarity, I think it’s likely that missed connections would be present in any empirical refit study and this simulation model provides a setup that could help evaluate the impacts. I’d be interested in how this influences the shape of distributions in Figure 9, for example.

Moving on to the examples, I think the theoretical graphs, the Liang Abu empirical example, and simulation based on the empirical example provide a good three-pronged approach to evaluating the technique. I have some questions about the results that I think could require some clarification in the text.

For the first analysis where archaeologists were asked to sort the theoretical graphs you got results that were extremely variable which is not surprising to me because it seems a pretty difficult task to evaluate these kinds of topological patterns on sight. In section 3.1.1 you compare the admixture model and others to the archaeological intuition. Given the variability in the archaeological intuitive coding, I don’t know that the methods used here fitting in the same cluster is a particularly convincing argument in favor of these new methods (and that’s okay). To me, the results you present suggest that people are generally not great at evaluating the relative order of topological patterns and that, in and of itself, argues for some sort of automated approach like you’ve developed here. The clustering diagram and the discussion in the text is a bit confusing, however, as it stands as not every bar in the cluster diagram is labeled and it’s unclear which “secondary clusters” you are referring to in the text. Beyond this, the Figure 8 caption says you are evaluating the four methods but it’s not entirely clear to me which is represented by which label in the existing cluster tree diagram. I think this figure will need to be modified.

The results discussed and shown in Figure 9 are helpful for thinking about how different values for these variables change with variation in balance and disturbance. I think this figure could be improved with a bit more guiding text. The x-axis is labeled Cohesion but Admixture is shown on this same axis with the same 0-1 scale so I would suggest changing that. It would also help to label the rows and columns with balance and disturbance respectively. Also, I think this is an excellent place for you to return to your Table 1 insights and discuss how you might interpret various of these plots in light of that table. Finally, I would suggest just increasing the border for the Admixture color as it isn’t really visible in the plots further to the left.

Figure 10 works and makes the point discussed in the text well. It would be interesting if these were also associated with a non-parametric correlation test like Spearman’s rho just to get a sense of how they differ in terms of rank order correlations. Consider splitting Figure 11 into two figures so that the labels in the boxplot are clearer. The differences in the stabilization of disturbance values for edge count and admixture (edge betweenness) are interesting.

The empirical example from Liang Abu is informative for thinking about how this all would actually work in practice. Notably, the values for cohesion differ in a pretty obvious way but the other three measures (excluding Modularity) show pretty minor differences (and all show 0 for the 0&1 comparison which makes sense). How might you think about these results in relation to your schema in table 1? Is the difference in Cohesion substantial? What would you interpret as a meaningful difference here? Discussing this a bit in section 3.3.1 would be helpful in walking a reader through what you’ve done and provide a justification for the hypothesis testing and simulation you do in the next section.

The hypothesis testing is an important part of the overall argument and I have some questions and need a bit of clarification in a couple of places here. I followed along with your results and also replicated them in R using the code in the supplement. I found the presentation of these results to be fairly brief given that some major interpretations hinge on them. The set up makes sense as your question is basically whether you could consider this context as having one or two layers initially. You use Wilcox test to assess rank order differences in several variables, which seems like a sensible approach and find evidence for differences in the simulated distributions and not others. Given that you have a low but non-null admixture you are really conducting these simulations to assess differences in cohesion in terms of the schema you laid out in table 1. Since scenario 3 and scenario 4 have the same interpretation (and admixture doesn’t differ for either scenario) how do the results you present in this section and Figures 12 and 13 related to the setup you initially provided in table 1? I feel like this needs more discussion. The results for specific variables differ in terms of which hypothesis is more or less supported. The results seem equivocal for several variables and looking at the supplement the although there are statistically significant rank order shifts the distributions are quite similar for some variables suggesting the actual effect is small. You land on an interpretation of two independent layers but I think this needs to be better justified. Given the low admixture value and the setup in table 1, is there a scenario that would have generated different results? I would also be interested in seeing an exploration of the relationship between structural admixture and the absolute difference in cohesion for a range of scenarios.

In general, the results outlined here make sense for this particular example, but I think this empirical example is only able to capture a small range of the possible scenarios that might be encountered in real settings. In general, I would like to see what kind of numbers would generate higher admixture values and how those might compare to what is shown here. In general, the set up in the beginning of the paper outlines the relationship between admixture and cohesion in table 1 but this doesn’t come in to play in the interpretation of the empirical example and I think this makes the final interpretation of the hypothesis test less impactful. I would suggest returning to the table 1 set up in the hypothesis testing section and perhaps even discussing in more detail other realistic simulated datasets that would land you somewhere else on table 1.

Overall, I think this method is potentially useful and most of the implementation was described in an understandable way. I think the major thing that could be improved is the connection between the initial discussion of the method (section 2.2 in particular) and the hypothesis testing and simulation in the end.

In addition to the text and supplement, I also installed and reviewed the R package. I was able to install this but I had to roll back to and earlier (3.6) version of R as one dependency was not available for the most recent version (4.0) of R. If that is an easy fix (looking at the BiocManager packages) it would be helpful to potential future users. As for the supplement, I was not able to fully replicate those results because there are a couple of files that were called in the code that weren’t provided on the OSF link as the markdown document in pdf was the only document posted. I think it would be useful to post the supplement as an Rmd file along with all of the required files to completely replicate the results. Once I was able to get the package installed in R 3.6, I was able to run all of the functions without any problems.

https://doi.org/10.24072/pci.archaeo.100156.rev13