SEBD 2024 Accepted Papers


Doctoral Consortium Papers

  • #7 Understanding Emerging Cyber Attacks and Vulnerabilities Targeting Maritime Systems. Giacomo Longo (University of Genova)*
  • #19 Data/Process Analysis for Advanced Interoperable Cyber Ranges. Giuseppe Salerno (Università della Calabria)*
  • #66 Large Language Models integration in Digital Humanities. Giovanni Sullutrone (University of Modena and Reggio Emilia)*
  • #82 Symbolic Regression for Medical Scoring Systems: a Bayesian and Multi-Objective Approach. Mattia Billa (University of Modena and Reggio Emilia)*
  • #83 A Lakehouse-based platform for Data-driven Sustainability Monitoring in Energy-Intensive Production. Paola Magrino (University of Brescia)*

Papers Accepted for Oral Presentations

  • #1 Emotional Data Querying: The BI Scenario. sandro bimonte (inrae); Patrick Marcel (University of Orléans); Stefano Rizzi (DISI - University of Bologna)*
  • #5 Building Taxonomies with Triplet Queries. Donatella Firmani (Sapienza University)*; Sainyam Galhotra (Cornell University); Barna Saha (UCSD); Divesh Srivastava (AT&T Chief Data Office)
  • #8 A Provenance-Based Caching System to Speed-up SPARQL Query Answering. Gianmaria Silvello (University of Padova)*; Dennis Dosso (SIAV SpA)
  • #9 FDup framework: A General-purpose solution for Efficient Entity Deduplication of Record Collections. Michele De Bonis (Istituto di Scienza e Tecnologie dell’Informazione “A. Faedo” - CNR)*; Claudio Atzori (CNR - ISTI); Sandro La Bruzzo (CNR - ISTI); Paolo Manghi (CNR - ISTI)
  • #10 Bootstrapping Gene Expression-Cancer Knowledge Bases with Limited Human Annotations. Stefano Marchesin (Università di Padova)*; Laura Menotti (University of Padua); Fabio Giachelle (University of Padova); Gianmaria Silvello (University of Padova); Omar Alonso (Amazon)
  • #11 From why-provenance to why+provenance: Towards addressing deep data explanations in Data-Centric AI. Paolo Missier (University of Birmingham)*; Riccardo Torlone (Roma Tre University)
  • #13 Efficient and Effective Multi-Vector Dense Retrieval with EMVB. Franco Maria Nardini (ISTI-CNR, Italy); Cosimo Rulli (ISTI-CNR)*; Rossano Venturini (University of Pisa)
  • #15 Mining Validating Shape for Large Knowledge Graphs via Dynamic Reservoir Sampling. Matteo Lissandrini (University of Verona)*; Kashif Rabbani (Aalborg University Denmark); Katja Hose (TU Wien)
  • #17 Enhancing Next Activity Prediction with Adversarial Training of Vision Transformers. Vincenzo Pasquadibisceglie (University of Bari Aldo Moro)*; Annalisa Appice (University of Bari Aldo Moro); Giovanna Castellano (University of Bari Aldo Moro, Italy); Donato Malerba (Università degli Studi di Bari Aldo Moro)
  • #18 Attacking Maritime Control Systems Through Process Mining. Giacomo Longo (University of Genova); Francesco Lupia (University of Calabria)*; Enrico Russo (University of Genoa); Andrea Pugliese (University of Calabria)
  • #21 A Comparative Assessment of eXplainable AI Tools in Predicting Hard Disk Drive Health. Flora Amato (University of Naples “Federico II”); Antonino Ferraro (University of Naples Federico II); Antonio Galli (Università degli studi di Napoli Federico II)*; Valerio La Gatta (University of Naples Federico II); Francesco Moscato (University of Salerno); Vincenzo Moscato (University of Naples “Federico II”); Carlo Sansone (Universita’ degli Studi di Napoli); Giancarlo Sperlì (University of Naples Federico II)
  • #24 Overlap-Based Duplicate Table Detection. Luca Zecchini (University of Modena and Reggio Emilia)*; Tobias Bleifuß (Hasso Plattner Institute); Giovanni Simonini (University of Modena and Reggio Emilia); Sonia Bergamaschi (Università di Modena e Reggio Emilia); Felix Naumann (Hasso Plattner Institute, University of Potsdam)
  • #26 Data Filtering for a Sustainable Model Training. Francesco Scala (CNR-ICAR and Unical )*; Sergio Flesca; Luigi Pontieri (CNR-ICAR, Italy)
  • #27 The Future of Sustainable Data Preparation. Barbara Pernici (Politecnico di Milano)*; Cinzia Cappiello (Politecnico di Milano); Edoardo Ramalli (Politecnico di Milano, MIT); Matteo Palmonari (University of Milano-Bicocca); Federico Belotti (University of Milano-Bicocca); Flavio De Paoli (University of Milano-Bicocca); Sonia Bergamaschi (Università di Modena e Reggio Emilia); Luca Zecchini (University of Modena and Reggio Emilia); Giovanni Simonini (University of Modena and Reggio Emilia); Angelo Mozzillo (University of Modena and Reggio Emilia); Tiziana Catarci (University of Rome “La Sapienza”); Matteo Filosa (University of Roma “La SApienza”); Marco Angelini (University of Rome “La Sapienza”); Dario Benvenuti (Sapienza, University of Rome)
  • #30 Nowcasting of the energy production of wind power plants through spatially-aware model trees. Annunziata D’Aversa (University of Bari “Aldo Moro”)*; Gianvito Pio (University of Bari)
  • #33 CAMEO: Fostering Joint Conversational Search and Recommendation. Tommaso Di Noia (Politecnico di Bari); Guglielmo Faggioli (University of Padova)*; Marco Ferrante (University of Padova); Nicola Ferro (University of Padova); Fedelucio Narducci (Politecnico di Bari); Raffaele Perego (ISTI-CNR); Giuseppe Santucci (University of Rome “La Sapienza”)
  • #34 Computing the Why-Provenance for Datalog Queries via SAT Solvers. Marco Calautti (University of Milan)*; Ester Livshits (University of Edinburgh); Andreas Pieris (University of Edinburgh); Markus Schneider (University of Edinburgh)
  • #38 Combining Entity Resolution and Query Answering in Ontologies: A Formal Conceptual Framework. Ronald Fagin (IBM Research - Almaden); Phokion Kolaitis; Domenico Lembo (Sapienza University of Rome); Lucian Popa (IBM Almaden Research Center); Federico M Scafoglieri (La Sapienza)*
  • #39 Causal Mediation Analysis for Interpreting Large Language Models. Elisabetta Rocchetti (Università degli Studi di Milano)*; Alfio Ferrara (Università degli Studi di Milano)
  • #40 ASYDE: An Argumentation-based System for classifYing Driving bEhaviors. Bettina Fazzinga (DICES - UNICAL)*; Sergio Flesca; Filippo Furfaro (University of Calabria); Giuseppina Monterosso (Università della Calabria)
  • #41 Evaluating status and value assortativity in Threads. Gianluca Bonifazi (Università Politecnica delle Marche); Enrico Corradini (Polytechnic University of Marche)*; Domenico Ursino (Università Politecnica delle Marche)
  • #43 Speeding up Vision Transformers Through Reinforcement Learning. Francesco Cauteruccio (Università degli Studi di Salerno); Michele Marchetti (Università Politecnica delle Marche); Davide Traini (Università di Modena e Reggio Emilia); Domenico Ursino (Università Politecnica delle Marche); Luca Virgili (Università Politecnica delle Marche)*
  • #48 A Clustering-based Approach for Interpreting Black-box Models. Luca Ferragina (University of Calabria); Simona Nisticò (University of Calabria)*
  • #49 Design of a Telemedicine Infrastructure for Rural and Remote Areas. Prof. Pietro Hiram Guzzi (Univ. Magna Gracia of Catanzaro, IPC, ICIEV)*; Pierangelo Veltri (Unical); Patrizia Vizza (University of Catanzaro); Sergio Greco (Unical); Giuseppe Tradigo (eCampus University)
  • #50 Personalised Exploration Graphs on top of Data Lakes. Devis Bianchini (University of Brescia); Valeria De Antonellis (University of Brescia); Massimiliano Garda (University of Brescia)*
  • #51 Comparing Incomplete Database Instances. Boris Glavic (Illinois Institute of Technology); Giansalvatore Mecca (Universita della Basilicata); Renée J. Miller (Northeastern University); Paolo Papotti (EURECOM); Donatello Santoro (Università della Basilicata); Enzo Veltri (Università della Basilicata)*
  • #52 A Framework for the Generation of Training Examples from Tabular Data. Jean-Flavien Bussotti (Eurecom); Enzo Veltri (Università della Basilicata)*; Donatello Santoro (Università della Basilicata); Paolo Papotti (EURECOM)
  • #53 Initial Achievements in Relation Extraction from RNA-focused Scientific Papers. Emanuele Cavalleri (Università degli Studi di Milano); Mauricio Soto Gomez (University of Milano); Ali Pashaeibarough (University Of Milano); Dario Malchiodi (Università degli Studi di Milano); Harry Caufield (Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA); Justin Reese (Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA); Peter Robinson (Berlin Institute of Health - Charite’, Universitatsmedizin, Berlin, 13353, Germany); Chris J Mungall (Lawrence Berkeley National Laboratory); Elena Casiraghi (University Of Milano); Giorgio Valentini (University Of Milano); Marco Mesiti (University of Milano)*
  • #55 How Transformers Are Revolutionizing Entity Matching. Matteo Paganelli (Hasso Plattner Institute); Donato Tiano (Università degli Studi di Modena e Reggio Emilia); Francesco Del Buono (University of Modena e Reggio Emilia); Andrea Baraldi (Università di Modena e Reggio Emilia); Riccardo Benassi (Università di Modena e Reggio Emilia); Giacomo Guiduzzi (University of Modena and Reggio Emilia); Francesco Guerra (University of Modena e Reggio Emilia)*
  • #58 Clustering Amendments with Semantic Embeddings. Alessandro Sajeva (Università degli Studi Roma Tre)*; Paolo Merialdo (Università degli Studi Roma Tre); Carlo Marchetti (Senato della Repubblica Italiana); Stefano Iannucci (Roma Tre University); Riccardo Torlone (Roma Tre University)
  • #62 An Innovative Big Temporal Data Analytics Technique over Real- Life Healthcare Datasets: The F-TBDA Approach. Alfredo Cuzzocrea (Universitá della Calabria)*; Geertruida H. de Bock (University of Groningen); Willemijn J. Maas (University of Groningen); Selim Soufargi (University of Calabria); Abderraouf Hafsaoui (iDEA Lab, University of Calabria)
  • #64 Assessing Speech Model Performance: A Subgroup Perspective. Alkis Koudounas (Politecnico di Torino)*; Eliana Pastor (Politecnico di Torino); Elena Baralis (Politecnico di Torino)
  • #65 Text-to-SQL with Large Language Models: Exploring the Promise and Pitfalls. Luca Sala (University of Modena and Reggio Emilia)*; Giovanni Sullutrone (University of Modena and Reggio Emilia); Sonia Bergamaschi (Università di Modena e Reggio Emilia)
  • #68 From Product Sheet to Text and Video: A NLG Pipeline to Transform Structured Data into Comprehensive Descriptions. Andrea Avignone (Politecnico di Torino)*; Alessandro Fiori (Politecnico di Torino); Silvia Chiusano (Politecnico di Torino); Giuseppe Rizzo (LINKS Foundation)
  • #69 “Dead or Alive, we can deny it”. A Differentially Private Approach to Survival Analysis.. Francesco L De Faveri (University of Padova)*; Guglielmo Faggioli (University of Padova); Nicola Ferro (University of Padova); Riccardo Spizzo (CRO Aviano)
  • #70 Food Certification through Collaborative Sensory Analysis Methods and Tools. Ada Bagozi (University of Brescia)*; Devis Bianchini (University of Brescia)
  • #81 Verification of Unary Communicating Datalog Programs. C. Aiswarya; Francesco Di Cosmo (Free University of Bozen-Bolzano); Diego Calvanese (Free University of Bozen-Bolzano)*; Marco Montali (Free University of Bozen-Bolzan Italy)
  • #84 Process-level Model Repair through Instance Graphs Representations. Laura Genga (Eindhoven University of Technology); Claudia Diamantini (Università Politecnica delle Marche)*; Emanuele Storti (Università Politecnica delle Marche); Domenico Potena (Universita’ Poltiecnica delle Marche)

Papers accepted for Booster Session and Poster Presentation

  • #2 Towards a Standard for Triggers in Property Graphs. Luigi Bellomarini (Banca d’Italia); Anna Bernasconi (Politecnico di Milano); Stefano Ceri (Politecnico di Milano); Alessia Ms. Gagliardi (Politecnico di Milano); Davide Magnanimi (Banca d’Italia); Davide Martinenghi (Politecnico di Milano)*
  • #3 Colossal Trajectory Mining: Semantic Co-movement Pattern Mining. Chiara Forresi (DISI - University of Bologna); Matteo Francia (DISI - University of Bologna)*; Enrico Gallinucci (DISI - University of Bologna); Matteo Golfarelli (DISI - University of Bologna); Manuele Pasini (Università di Bologna)
  • #4 ReliK: A Reliability Measure for Knowledge Graph Embeddings [Extended Abstract]. Maximilian K Egger (Aarhus University); Wenyue Ma (Aarhus University); Davide Mottin (Aarhus University)*; Panagiotis Karras (University of Copenhagen); Ilaria Bordino (UniCredit R&D); Francesco Gullo (University of L’Aquila); Aris Anagnostopoulos (Sapienza University of Rome)
  • #6 Impact of Data Augmentation on Hate Speech Detection in Roman Urdu. Fariha Maqbool (University of Milano Bicocca); Blerina Spahiu (unimib); Andrea Maurino (Università di Milano Bicocca )*
  • #12 Autonomous Intelligent Systems: From Illusion of Control to Inescapable Delusion. Stephane Grumbach (INRIA); Giorgio Resta (Roma Tre University); Riccardo Torlone (Roma Tre University)*
  • #14 Infantile Predictors of Functional Gastrointestinal Disorders: A Machine Learning Approach to Risk Assessment. Enea Vincenzo Napolitano (University of Naples Federico II)*; Elio Masciari (University of Naples, Italy); Flavia Indrio (University of Salento); Flavia Marchese (University of Foggis); Matteo Rinaldi (Ospedali Riuniti Foggia); Gianfranco Maffei (Ospedali Riuniti Foggia); Isadora Beghetti (University of Bologna); Luigi Corvaglia (University of Bologna); Arianna Aceti (University of Bologna)
  • #16 The ARIADNEplus Knowledge Base. Alessia Bardi (CNR - ISTI)*; Miriam Baglioni (CNR); Andrea Mannocci (CNR - ISTI); Gina Pavone (CNR)
  • #20 Named Entity Recognition using context similarity data augmentation. Ilaria Bartolini (University of Bologna); Angelo Chianese (University of Naples Federico II); Vincenzo Moscato (University of Naples “Federico II”); Marco Postiglione (Federico II); Giancarlo Sperlì (University of Naples Federico II)*; Andrea Vignali (University of Naples Federico II)
  • #22 Mitigating Unfairness in Machine Learning: A Taxonomy and an Evaluation Pipeline. Chiara Criscuolo (Politecnico di Milano)*; Tommaso Dolci (Lero, University of Limerick); Mattia Salnitri (Politecnico di Milano)
  • #28 Machine Learning-Augmented Ontology-Based Data Access for Renewable Energy Data. Marco Calautti (University of Milan)*; Damiano Duranti (University of Trento); Paolo Giorgini (University of Trento)
  • #31 POLARIS: A framework to guide the development of Trustworthy AI systems. Maria Teresa Baldassarre (University of Bari “A. Moro”); Danilo Caivano (University of Bari “A. Moro”); Domenico Gigante (University of Bari “A. Moro”)*; Azzurra Ragone (University of Bari)
  • #35 Refining Triplet Sampling for Improved Self-Supervised Representation Learning. Manuel A Goyo (niversidad Técnica Federico Santa María)*; Giacomo Frisoni (University of Bologna, Italy); Gianluca Moro (DISI - University of Bologna); Claudio Sartori (University of Bologna)
  • #36 Using Graph Neural Networks for Heterogeneous Event Classification. valerio bellandi (Università degli Studi di Milano); Stefano Montanelli (Università degli Studi di Milano); Darya Shlyk (Università degli Studi di Milano)*; Stefano Siccardi (Consorzio Interuniversitario Nazionale per l’Informatica)
  • #37 Automated Knowledge Extraction from Legal Texts using ASKE. Silvana Castano (Università degli Studi di Milano); Alfio Ferrara (Università degli Studi di Milano); Stefano Montanelli (Università degli Studi di Milano); Sergio Picascia (Università degli Studi di Milano)*; Davide Riva (Università degli Studi di Milano)
  • #42 Integrating Brain Networks and Multi-Modal Data for Early Detection of Alzheimer’s Disease. Carmela Comito (ICAR-CNR)*; Clara Pizzuti (CNR-ICAR, Italy); Marcello Sammarra (ICAR-CNR); Annalisa Socievole (ICAR-CNR)
  • #44 Data Pipelines Assessment: The Role of Data Engine Deployment Models. Claudio A. Ardagna (Università degli Studi di Milano)*; valerio bellandi (Università degli Studi di Milano); Marco Luzzara (-); Antongiacomo Polimeno (Università Degli Studi Di Milano)
  • #45 Schema Decomposition via Transformation Patterns. Théo Abgrall (Free University of Bozen-Bolzano)*
  • #46 Privacy-Preserving Data Integration for Health: Adhering to OMOP-CDM Standard. Lisa Trigiante (Università degli studi di Modena e Reggio Emilia)*; DOMENICO BENEVENTANO (DIEF UNIMORE)
  • #47 A Minimum Metadataset for Data Lakes Supporting Healthcare Research. Davide Piantella (Politecnico di Milano)*; Pierluigi Reali (Politecnico di Milano); Priyansh Kumar (Politecnico di Milano); Letizia Tanca (“Politecnico di Milano”)
  • #57 Symbolic Regression for Transparent Clinical Decision Support: A Data-Centric Framework for Scoring System Development. Veronica Guidetti (University of Modena and Reggio Emilia)*; Federica Mandreoli (università di modena e reggio emilia)
  • #60 Biases in Toxicity Detection Models. Gianluca Nogara (SUPSI); Francesco Pierri (Politecnico di Milano)*; Stefano Cresci (IIT-CNR); Luca Luceri (Information Sciences Institute USC); Petter Tornberg (University of Amsterdam); Silvia Giordano (Networking Lab, University of Applied Sciences of Southern Switzerland SUPSI, Switzerland.)
  • #61 Compressing Big OLAP Data Cubes over Mobile Clouds: A Hierarchy-Based Data Partitioning Approach. Alfredo Cuzzocrea (Universitá della Calabria)*
  • #71 Exploring Large Language Models for Procedure Extraction from Documents. Anisa Rula (University of Brescia)*
  • #72 Max Flow Vulnerability of Undirected Planar Networks. Lorenzo Balzotti (Sapienza University of Rome)*; Paolo Franciosa (Sapienza University of Rome)
  • #73 PNRRorienta: A Web Application for Managing Schools, Courses, and Students Involved in the PNRR Orientation Initiative. Andrea Pasin (University of Padua)*; Lorenza Da Re (University of Padua); Andrea Gerosa (University of Padua); Lidia Pezzuoli (University of Padua); Silvia Preciso (University of Padua); Nicola Ferro (University of Padua)
  • #76 Improving Malicious Accounts Discrimination through a New Feature Engineering Approach Using Relaxed Functional Dependencies. Loredana Caruccio (University of Salerno); Gaetano Cimino (University of Salerno); Stefano Cirillo (University of Salerno)*; Domenico Desiato (University of Salerno); Giuseppe Polese (University of Salerno); Genoveffa Tortora (University of Salerno)
  • #79 Data and System Traceability for Transparent AI in Medical Imaging. Sara Colantonio (Institute of Information Science and Technologies of the National Research Council of Italy)*; Andrea Berti (Institute of Information Science and Technologies of the National Research Council of Italy); Gianluca Carloni (Institute of Information Science and Technologies of the National Research Council of Italy ); Claudia Caudai (CNR); Giulio Del Corso (ISTI-CNR); Danila Germanese (Institute of Information Science and Technologies of the National Research Council of Italy); Eva Pachetti (Institute of Information Science and Technologies of the National Research Council of Italy); Maria Antonietta Pascali (Institute of Information Science and Technologies of the National Research Council of Italy); Valia Kalokyri (Foundation of Research and Technology Hellas); Haridimos Kondylakis (FORTH-ICS); Charalampos Kalantzopoulos (Foundation for Research and Technology Hellas); Nikolaos Tachos (Foundation of Research and Technology Hellas); Dimitris Fotiadis (Foundation for Research and Technology Hellas); Valentina Giannini (University of Turin); Simone Mazzetti (Department of Radiology, Candiolo Cancer Institute, ); Daniele Regge (Department of Radiology, Candiolo Cancer Institute, ); Nickolas Papanikolaou (Champalimaud Foundation); Kostas Marias (FORTH); Manolis Tsiknakis (Foundation for Research and Technology Hellas)
  • #85 TypoAlert: a browser extension against typosquatting. Francesco Blefari (Università della Calabria)*; Angelo Furfaro (Università della Calabria); Giovambattista Ianni (Università della Calabria); Alessandro Viscomi (Università della Calabria)
  • #86 Identifying key factors in designing data spaces for Urban Digital Twin Platforms: a data driven approach.* Cristian Martella (University of Salento)*; Angelo Martella (University of Salento); Amro Issam Hamed Attia Ramadan (University of Salento); Antonella Longo (University of Salento)