KMi Publications

Tech Reports

Tech Report kmi-96-08 Abstract


Robust Learning with Missing Data
Techreport ID: kmi-96-08
Date: 1996
Author(s): Marco Ramoni and Paola Sebastiani
Download Postscript

Bayesian methods are becoming increasingly popular in the development of intelligent machines. Bayesian Belief Networks (BBNs) are nowaday a prominent reasoning method and, during the past few years, several efforts have been addressed to develop methods able to learn BBNs directly from databases. However, all these methods assume that the database is complete or, at least, that unreported data are missing at random. Unfortunately, real-world databases are rarely complete and the "Missing at Random" assumption is often unrealistic. This paper shows that this assumption can dramatically affect the reliability of the learned BBN and introduces a robust method to learn conditional probabilities in a BBN, which does not rely on this assumption. In order to drop this assumption, we have to change the overall learning strategy used by traditional Bayesian methods: our method bounds the set of all posterior probabilities consistent with the database and proceed by refining this set as more information becomes available. An experimental comparison - using both an artificial example and a real medical application - of our method with a powerful stochastic simulator will show a dramatic gain in robustness and the computational advantages of our deterministic method. 1. Knowledge Media Institute, The Open University. 2. Department of Actuarial Science and Statistics, City University.
 
KMi Publications Event | SSSW 2013, The 10th Summer School on Ontology Engineering and the Semantic Web Journal | 25 years of knowledge acquisition
 

Future Internet is...


Future Internet
With over a billion users, today's Internet is arguably the most successful human artifact ever created. The Internet's physical infrastructure, software, and content now play an integral part of the lives of everyone on the planet, whether they interact with it directly or not. Now nearing its fifth decade, the Internet has shown remarkable resilience and flexibility in the face of ever increasing numbers of users, data volume, and changing usage patterns, but faces growing challenges in meetings the needs of our knowledge society. Globally, many major initiatives are underway to address the need for more scientific research, physical infrastructure investment, better education, and better utilisation of the Internet. Within Japan, USA and Europe major new initiatives have begun in the area.

To succeed the Future Internet will need to address a number of cross-cutting challenges including:

  • Scalability in the face of peer-to-peer traffic, decentralisation, and increased openness

  • Trust when government, medical, financial, personal data are increasingly trusted to the cloud, and middleware will increasingly use dynamic service selection

  • Interoperability of semantic data and metadata, and of services which will be dynamically orchestrated

  • Pervasive usability for users of mobile devices, different languages, cultures and physical abilities

  • Mobility for users who expect a seamless experience across spaces, devices, and velocities