{"id":25364,"date":"2025-07-25T09:35:27","date_gmt":"2025-07-25T07:35:27","guid":{"rendered":"https:\/\/sano.science\/?post_type=research&#038;p=25364"},"modified":"2025-07-25T09:36:47","modified_gmt":"2025-07-25T07:36:47","slug":"stability-of-machine-learning-predictive-features-under-limited-data-2","status":"publish","type":"research","link":"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/","title":{"rendered":"Stability of Machine Learning Predictive Features Under Limited Data"},"content":{"rendered":"\n<h2 class=\"wp-block-heading eplus-wrapper\" id=\"h-karol-capala-paulina-tworek-jose-sousa\">Karol Capa\u0142a, Paulina Tworek, Jose Sousa\u00a0<\/h2>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n\n\n\n<p class=\"eplus-wrapper wp-block-paragraph\">In many fields\u2014including healthcare and biomedical sciences\u2014machine learning is increasingly used to support critical decision-making. But how reliable are these models when data is scarce or incomplete?<\/p>\n\n\n\n<p class=\"eplus-wrapper wp-block-paragraph\">Autors investigate this issue by examining the stability of predictive features in machine learning models trained on limited datasets. Their study compares conventional ML approaches with a previously introduced method that leverages data abstractions to enhance learning under imperfect conditions.<\/p>\n\n\n\n<p class=\"eplus-wrapper wp-block-paragraph\">The results highlight that the abstraction-based approach not only maintains strong classification performance but also ensures greater consistency in feature selection\u2014even as data availability decreases. This work demonstrates that machine learning systems can be designed to remain interpretable and robust, even in the face of data scarcity, bringing us closer to safe and autonomous AI-based decision-making in complex domains.<\/p>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n\n\n\n<p class=\"eplus-wrapper wp-block-paragraph\"><strong>Authors<\/strong>: <a href=\"https:\/\/sano.science\/people\/karol-capala\/\">Karol Capa\u0142a<\/a>, <a href=\"https:\/\/sano.science\/people\/paulina-tworek\/\">Paulina Tworek<\/a>, <a href=\"https:\/\/sano.science\/people\/jose-sousa\/\">Jose Sousa<\/a><\/p>\n\n\n\n<p class=\"eplus-wrapper wp-block-paragraph\"><strong>DOI<\/strong>: <a href=\"https:\/\/doi.org\/10.1109\/TKDE.2025.3580671\" target=\"_blank\" rel=\"noreferrer noopener\">10.1109\/TKDE.2025.3580671<\/a><\/p>\n\n\n\n<p class=\"eplus-wrapper wp-block-paragraph\"><strong>Keywords<\/strong>: feature stability, Classification, data abstractions, limited data,  explainability, machine learning, predictions.<\/p>\n\n\n\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n\n\n\n\t\n    \n        \n\t\t\t<a href=\"https:\/\/ieeexplore.ieee.org\/document\/11039685\" target=\"_blank\" rel= \"noopener noreferrer nofollow\" class=\"button primary \">\n\n\t\t\t\t<span>\n\t\t\t\t\tREAD HERE\n\t\t\t\t<\/span>\n\n\t\t\t<\/a>\n\n        \n    \n","protected":false},"excerpt":{"rendered":"<p>Journal paper in: IEEE Transactions on Knowledge and Data Engineering, 2025<\/p>\n","protected":false},"featured_media":0,"template":"","research_type":[8],"research_team":[14],"class_list":["post-25364","research","type-research","status-publish","hentry","research_type-publications","research_team-computational-intelligence"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v28.0 (Yoast SEO v28.0) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Stability of Machine Learning Predictive Features Under Limited Data - Centre for Computational Personalized Medicine<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Stability of Machine Learning Predictive Features Under Limited Data\" \/>\n<meta property=\"og:description\" content=\"Journal paper in: IEEE Transactions on Knowledge and Data Engineering, 2025\" \/>\n<meta property=\"og:url\" content=\"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Centre for Computational Personalized Medicine\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/sano.science\/\" \/>\n<meta property=\"article:modified_time\" content=\"2025-07-25T07:36:47+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@sanoscience\" \/>\n<meta name=\"twitter:label1\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data1\" content=\"1 minute\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/sano.science\\\/research\\\/stability-of-machine-learning-predictive-features-under-limited-data-2\\\/\",\"url\":\"https:\\\/\\\/sano.science\\\/research\\\/stability-of-machine-learning-predictive-features-under-limited-data-2\\\/\",\"name\":\"Stability of Machine Learning Predictive Features Under Limited Data - Centre for Computational Personalized Medicine\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/sano.science\\\/#website\"},\"datePublished\":\"2025-07-25T07:35:27+00:00\",\"dateModified\":\"2025-07-25T07:36:47+00:00\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/sano.science\\\/research\\\/stability-of-machine-learning-predictive-features-under-limited-data-2\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/sano.science\\\/research\\\/stability-of-machine-learning-predictive-features-under-limited-data-2\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/sano.science\\\/research\\\/stability-of-machine-learning-predictive-features-under-limited-data-2\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/sano.science\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Research\",\"item\":\"https:\\\/\\\/sano.science\\\/research\\\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Publications\",\"item\":\"https:\\\/\\\/sano.science\\\/research-type\\\/publications\\\/\"},{\"@type\":\"ListItem\",\"position\":4,\"name\":\"Stability of Machine Learning Predictive Features Under Limited Data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/sano.science\\\/#website\",\"url\":\"https:\\\/\\\/sano.science\\\/\",\"name\":\"Centre for Computational Personalized Medicine\",\"description\":\"Sano \u2013 Centre for Computational Medicine\",\"publisher\":{\"@id\":\"https:\\\/\\\/sano.science\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/sano.science\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/sano.science\\\/#organization\",\"name\":\"Sano \u2013 Centre for Computational Medicine\",\"alternateName\":\"Sano\",\"url\":\"https:\\\/\\\/sano.science\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/sano.science\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/sano.science\\\/wp-content\\\/uploads\\\/2024\\\/05\\\/logo_sano_podstawowe.png\",\"contentUrl\":\"https:\\\/\\\/sano.science\\\/wp-content\\\/uploads\\\/2024\\\/05\\\/logo_sano_podstawowe.png\",\"width\":700,\"height\":265,\"caption\":\"Sano \u2013 Centre for Computational Medicine\"},\"image\":{\"@id\":\"https:\\\/\\\/sano.science\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/sano.science\\\/\",\"https:\\\/\\\/x.com\\\/sanoscience\",\"https:\\\/\\\/www.linkedin.com\\\/company\\\/sanoscience\\\/\",\"https:\\\/\\\/www.youtube.com\\\/channel\\\/UCDZ_8TcjMWUG2ZcgKKgfpwQ\",\"https:\\\/\\\/bsky.app\\\/profile\\\/sanoscience.bsky.social\"]}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Stability of Machine Learning Predictive Features Under Limited Data - Centre for Computational Personalized Medicine","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/","og_locale":"en_US","og_type":"article","og_title":"Stability of Machine Learning Predictive Features Under Limited Data","og_description":"Journal paper in: IEEE Transactions on Knowledge and Data Engineering, 2025","og_url":"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/","og_site_name":"Centre for Computational Personalized Medicine","article_publisher":"https:\/\/www.facebook.com\/sano.science\/","article_modified_time":"2025-07-25T07:36:47+00:00","twitter_card":"summary_large_image","twitter_site":"@sanoscience","twitter_misc":{"Est. reading time":"1 minute"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/","url":"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/","name":"Stability of Machine Learning Predictive Features Under Limited Data - Centre for Computational Personalized Medicine","isPartOf":{"@id":"https:\/\/sano.science\/#website"},"datePublished":"2025-07-25T07:35:27+00:00","dateModified":"2025-07-25T07:36:47+00:00","breadcrumb":{"@id":"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/sano.science\/research\/stability-of-machine-learning-predictive-features-under-limited-data-2\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/sano.science\/"},{"@type":"ListItem","position":2,"name":"Research","item":"https:\/\/sano.science\/research\/"},{"@type":"ListItem","position":3,"name":"Publications","item":"https:\/\/sano.science\/research-type\/publications\/"},{"@type":"ListItem","position":4,"name":"Stability of Machine Learning Predictive Features Under Limited Data"}]},{"@type":"WebSite","@id":"https:\/\/sano.science\/#website","url":"https:\/\/sano.science\/","name":"Centre for Computational Personalized Medicine","description":"Sano \u2013 Centre for Computational Medicine","publisher":{"@id":"https:\/\/sano.science\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/sano.science\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/sano.science\/#organization","name":"Sano \u2013 Centre for Computational Medicine","alternateName":"Sano","url":"https:\/\/sano.science\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/sano.science\/#\/schema\/logo\/image\/","url":"https:\/\/sano.science\/wp-content\/uploads\/2024\/05\/logo_sano_podstawowe.png","contentUrl":"https:\/\/sano.science\/wp-content\/uploads\/2024\/05\/logo_sano_podstawowe.png","width":700,"height":265,"caption":"Sano \u2013 Centre for Computational Medicine"},"image":{"@id":"https:\/\/sano.science\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/sano.science\/","https:\/\/x.com\/sanoscience","https:\/\/www.linkedin.com\/company\/sanoscience\/","https:\/\/www.youtube.com\/channel\/UCDZ_8TcjMWUG2ZcgKKgfpwQ","https:\/\/bsky.app\/profile\/sanoscience.bsky.social"]}]}},"acf":[],"gutenberg_blocks":[{"blockName":"custom-styles","attrs":{"styles":""}},{"blockName":"core\/heading","attrs":{"epAnimationGeneratedClass":"edplus_anim-Vx43Fy","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<h2 class=\"wp-block-heading eplus-wrapper\" id=\"h-karol-capala-paulina-tworek-jose-sousa\">Karol Capa\u0142a, Paulina Tworek, Jose Sousa\u00a0<\/h2>\n","innerContent":["\n<h2 class=\"wp-block-heading eplus-wrapper\" id=\"h-karol-capala-paulina-tworek-jose-sousa\">Karol Capa\u0142a, Paulina Tworek, Jose Sousa\u00a0<\/h2>\n"]},{"blockName":"core\/spacer","attrs":{"height":"30px","epAnimationGeneratedClass":"edplus_anim-oGP2hC","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n","innerContent":["\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n"]},{"blockName":"core\/paragraph","attrs":{"epAnimationGeneratedClass":"edplus_anim-61wN4F","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<p class=\" eplus-wrapper\">In many fields\u2014including healthcare and biomedical sciences\u2014machine learning is increasingly used to support critical decision-making. But how reliable are these models when data is scarce or incomplete?<\/p>\n","innerContent":["\n<p class=\" eplus-wrapper\">In many fields\u2014including healthcare and biomedical sciences\u2014machine learning is increasingly used to support critical decision-making. But how reliable are these models when data is scarce or incomplete?<\/p>\n"]},{"blockName":"core\/paragraph","attrs":{"epAnimationGeneratedClass":"edplus_anim-c9il9v","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<p class=\" eplus-wrapper\">Autors investigate this issue by examining the stability of predictive features in machine learning models trained on limited datasets. Their study compares conventional ML approaches with a previously introduced method that leverages data abstractions to enhance learning under imperfect conditions.<\/p>\n","innerContent":["\n<p class=\" eplus-wrapper\">Autors investigate this issue by examining the stability of predictive features in machine learning models trained on limited datasets. Their study compares conventional ML approaches with a previously introduced method that leverages data abstractions to enhance learning under imperfect conditions.<\/p>\n"]},{"blockName":"core\/paragraph","attrs":{"epAnimationGeneratedClass":"edplus_anim-61wN4F","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<p class=\" eplus-wrapper\">The results highlight that the abstraction-based approach not only maintains strong classification performance but also ensures greater consistency in feature selection\u2014even as data availability decreases. This work demonstrates that machine learning systems can be designed to remain interpretable and robust, even in the face of data scarcity, bringing us closer to safe and autonomous AI-based decision-making in complex domains.<\/p>\n","innerContent":["\n<p class=\" eplus-wrapper\">The results highlight that the abstraction-based approach not only maintains strong classification performance but also ensures greater consistency in feature selection\u2014even as data availability decreases. This work demonstrates that machine learning systems can be designed to remain interpretable and robust, even in the face of data scarcity, bringing us closer to safe and autonomous AI-based decision-making in complex domains.<\/p>\n"]},{"blockName":"core\/spacer","attrs":{"height":"30px","epAnimationGeneratedClass":"edplus_anim-oGP2hC","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n","innerContent":["\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n"]},{"blockName":"core\/paragraph","attrs":{"epAnimationGeneratedClass":"edplus_anim-7yxA0i","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<p class=\" eplus-wrapper\"><strong>Authors<\/strong>: <a href=\"https:\/\/sano.science\/people\/karol-capala\/\">Karol Capa\u0142a<\/a>, <a href=\"https:\/\/sano.science\/people\/paulina-tworek\/\">Paulina Tworek<\/a>, <a href=\"https:\/\/sano.science\/people\/jose-sousa\/\">Jose Sousa<\/a><\/p>\n","innerContent":["\n<p class=\" eplus-wrapper\"><strong>Authors<\/strong>: <a href=\"https:\/\/sano.science\/people\/karol-capala\/\">Karol Capa\u0142a<\/a>, <a href=\"https:\/\/sano.science\/people\/paulina-tworek\/\">Paulina Tworek<\/a>, <a href=\"https:\/\/sano.science\/people\/jose-sousa\/\">Jose Sousa<\/a><\/p>\n"]},{"blockName":"core\/paragraph","attrs":{"epAnimationGeneratedClass":"edplus_anim-Fum2ss","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<p class=\" eplus-wrapper\"><strong>DOI<\/strong>: <a href=\"https:\/\/doi.org\/10.1109\/TKDE.2025.3580671\" target=\"_blank\" rel=\"noreferrer noopener\">10.1109\/TKDE.2025.3580671<\/a><\/p>\n","innerContent":["\n<p class=\" eplus-wrapper\"><strong>DOI<\/strong>: <a href=\"https:\/\/doi.org\/10.1109\/TKDE.2025.3580671\" target=\"_blank\" rel=\"noreferrer noopener\">10.1109\/TKDE.2025.3580671<\/a><\/p>\n"]},{"blockName":"core\/paragraph","attrs":{"epAnimationGeneratedClass":"edplus_anim-bvpBYN","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<p class=\" eplus-wrapper\"><strong>Keywords<\/strong>: feature stability, Classification, data abstractions, limited data,  explainability, machine learning, predictions.<\/p>\n","innerContent":["\n<p class=\" eplus-wrapper\"><strong>Keywords<\/strong>: feature stability, Classification, data abstractions, limited data,  explainability, machine learning, predictions.<\/p>\n"]},{"blockName":"core\/spacer","attrs":{"height":"30px","epAnimationGeneratedClass":"edplus_anim-oGP2hC","epGeneratedClass":"eplus-wrapper"},"innerBlocks":[],"innerHTML":"\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n","innerContent":["\n<div style=\"height:30px\" aria-hidden=\"true\" class=\"wp-block-spacer eplus-wrapper\"><\/div>\n"]},{"blockName":"acf\/button","attrs":{"title":"READ HERE","button_type":"link","url":"https:\/\/ieeexplore.ieee.org\/document\/11039685","button_style":"primary","target":"_blank","button_extra_classes":""},"innerBlocks":[],"innerHTML":"","innerContent":[]}],"meta_data":{"is_automatically_other_posts":true,"number_of_posts":"3","is_automatically_check_also_posts":true},"_links":{"self":[{"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/research\/25364","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/research"}],"about":[{"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/types\/research"}],"version-history":[{"count":6,"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/research\/25364\/revisions"}],"predecessor-version":[{"id":25371,"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/research\/25364\/revisions\/25371"}],"wp:attachment":[{"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/media?parent=25364"}],"wp:term":[{"taxonomy":"research_type","embeddable":true,"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/research_type?post=25364"},{"taxonomy":"research_team","embeddable":true,"href":"https:\/\/sano.science\/index.php\/wp-json\/wp\/v2\/research_team?post=25364"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}