About: In data analytics, missing data is a factor that degrades performance. Incorrect imputation of missing values could lead to a wrong prediction. In this era of big data, when a massive volume of data is generated in every second, and utilization of these data is a major concern to the stakeholders, efficiently handling missing values becomes more important. In this paper, we have proposed a new technique for missing data imputation, which is a hybrid approach of single and multiple imputation techniques. We have proposed an extension of popular Multivariate Imputation by Chained Equation (MICE) algorithm in two variations to impute categorical and numeric data. We have also implemented twelve existing algorithms to impute binary, ordinal, and numeric missing values. We have collected sixty-five thousand real health records from different hospitals and diagnostic centers of Bangladesh, maintaining the privacy of data. We have also collected three public datasets from the UCI Machine Learning Repository, ETH Zurich, and Kaggle. We have compared the performance of our proposed algorithms with existing algorithms using these datasets. Experimental results show that our proposed algorithm achieves 20% higher F-measure for binary data imputation and 11% less error for numeric data imputations than its competitors with similar execution time.

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: In data analytics, missing data is a factor that degrades performance. Incorrect imputation of missing values could lead to a wrong prediction. In this era of big data, when a massive volume of data is generated in every second, and utilization of these data is a major concern to the stakeholders, efficiently handling missing values becomes more important. In this paper, we have proposed a new technique for missing data imputation, which is a hybrid approach of single and multiple imputation techniques. We have proposed an extension of popular Multivariate Imputation by Chained Equation (MICE) algorithm in two variations to impute categorical and numeric data. We have also implemented twelve existing algorithms to impute binary, ordinal, and numeric missing values. We have collected sixty-five thousand real health records from different hospitals and diagnostic centers of Bangladesh, maintaining the privacy of data. We have also collected three public datasets from the UCI Machine Learning Repository, ETH Zurich, and Kaggle. We have compared the performance of our proposed algorithms with existing algorithms using these datasets. Experimental results show that our proposed algorithm achieves 20% higher F-measure for binary data imputation and 11% less error for numeric data imputations than its competitors with similar execution time. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : wasabi.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	In data analytics, missing data is a factor that degrades performance. Incorrect imputation of missing values could lead to a wrong prediction. In this era of big data, when a massive volume of data is generated in every second, and utilization of these data is a major concern to the stakeholders, efficiently handling missing values becomes more important. In this paper, we have proposed a new technique for missing data imputation, which is a hybrid approach of single and multiple imputation techniques. We have proposed an extension of popular Multivariate Imputation by Chained Equation (MICE) algorithm in two variations to impute categorical and numeric data. We have also implemented twelve existing algorithms to impute binary, ordinal, and numeric missing values. We have collected sixty-five thousand real health records from different hospitals and diagnostic centers of Bangladesh, maintaining the privacy of data. We have also collected three public datasets from the UCI Machine Learning Repository, ETH Zurich, and Kaggle. We have compared the performance of our proposed algorithms with existing algorithms using these datasets. Experimental results show that our proposed algorithm achieves 20% higher F-measure for binary data imputation and 11% less error for numeric data imputations than its competitors with similar execution time.
subject	Distributed computing problems Scientific method Technology forecasting Kamen Rider television series
part of	SICE: an improved missing data imputation technique
is abstract of	SICE: an improved missing data imputation technique
is hasSource of	covid:ann/target/04a3fb06277a6645dfc6444717995b7b4b81aa20 covid:ann/target/4600423113caa4a841f46e3172fca20c26c78b0b covid:ann/target/1db259f6cd965639af549e07f62c367cc665bd50 covid:ann/target/efa48ee1952f094c3659f50ee0096eb965125781 covid:ann/target/18043c325b0bf0fc32c768bd1760fac57ec08942 covid:ann/target/23a1a443b974ec0937d9f0fc1d7607031c9f2005 covid:ann/target/3e82d97a13030e901ca7bb2a0af6ec5f614ab3f6 covid:ann/target/5e69cbdef9aae42fe6e79ba3767b4efec68a1354 covid:ann/target/aa9aebbaca2156a694cfc2447f6bab4e35c0bfe0 covid:ann/target/4b586fd636192f2908ad60e67fdea44ae055c384 covid:ann/target/4e9a49e584e8cd81f2b0dfeb5ee2066571067363 covid:ann/target/6c2feb735081fcabfbaf7f0daca8d8ef9f980c7d covid:ann/target/f6d14ee598b8be244e4d656eb20358484acf07a5 covid:ann/target/8ec3403c4b39d87bf15a59148e80c3c2296cde6d covid:ann/target/8ec938f6e85486b3478f57f13cc65d4b343d241c covid:ann/target/c984cc3aa8cccd13621098b59d46c4678814c48b covid:ann/target/744f7d2eae65d422ec0dcbab66bbb6d59f4bd5aa covid:ann/target/3913ac9a53b845ce36b15630c2d70c7525e164d9 covid:ann/target/18a92dc9aa61d09291a12f3fa92ab2fa714cc870 covid:ann/target/c336c92f81a8c91b5a19f70bca3a9c207a9dc259 covid:ann/target/8c3d4a90bb73fef31f60b6c0525210ce5b8d2855 covid:ann/target/471b242f790bc4d5efaded0bcb71a9b310e15be4 covid:ann/target/5cfd05e33418bc7b3a64048b7846306e80f54d47 covid:ann/target/b4c4aa26cc3d6007daf90faf2907bd0fad6ead2f covid:ann/target/039ec570ef2dc8663c35b80dca742df1666ff4c1 covid:ann/target/fc5d1e023250cec3f7d55f769a5c1577be81be29 covid:ann/target/d541d3091be10db0c1a57dbf77d85178e4833490 covid:ann/target/1a0cf7fd8a40ae0d995a57c910b6b2d89c5ce43a covid:ann/target/124d40b10ba53e4f698782155602f2c76a200fe1 covid:ann/target/72b670f9fb0b516b4d99af0ff142bdcc87d35204 covid:ann/target/962575729dbeafaf2b94fa56612779022eae01e6 covid:ann/target/0b8b074059b59615baaac8aa6fa8629e0a8a8f52 covid:ann/target/5963856d1cc67d875906998785129d4f356a1629 covid:ann/target/810b6c961fcf51bba792f333fd3714c89e34957e covid:ann/target/7efc0775394268066d13e51f9cf44e5ce3b915ff covid:ann/target/9f092907c58ee518305a3fbb4f21eea55482529d covid:ann/target/e9203e081ce39888068a01bb901acd567b58a87e covid:ann/target/840046eeba78b4a65fe3b53d8052ac343fa5e832 covid:ann/target/1ce7a1de70d5f496a596777e2b6fa4e1e9f587f9 covid:ann/target/014c13e8f5e859264c39b2f77b3dcb59881a5009 covid:ann/target/2ca233fb565589e4e563b9219d9436949c0a9c06 covid:ann/target/d269483b86ac82ca32074a63224c6b798191ce55 covid:ann/target/b8bf493544c228fdece44773c3810bbb9e392c56 covid:ann/target/4ed888ae250bc2dab5362ac24c13ba4188a693e6 covid:ann/target/4fdf51521364234a52f7b933a5bfde03dcae1b67 covid:ann/target/f8b6924b8b80a961af14b7cfb8cb03afb64e8167 covid:ann/target/69b04d063e3f0b5bf6571faaf4aef2e74a6bc92b covid:ann/target/6e5b0c39005b502c19a394c6b7f04b94f431f092 covid:ann/target/7789d3fec8aa1a8b5aca53e44fbb7a7b0d6a27c8 covid:ann/target/7aaf24157c61b6f6f7f5e685dc63ece856592ef1 covid:ann/target/9fb111808bfb6b48bda816387ca397f4508bdf4a covid:ann/target/3d40e1273b6cd79cd69160eff7975a42d18884a0 covid:ann/target/7a63a5ef2ba05de649d560a3a0761ac37850e2ca covid:ann/target/d52a1821d4ee8eb0246a395c49ea6422ef68573b covid:ann/target/4ae9276e00041d02da1c53bd959471bae54b1025 covid:ann/target/ad45ed3173ca0480f7c7719dd18a087d567fa4fd covid:ann/target/e4bb9f9238410a0e8f95aaed75a6b1cbd4c84d76 covid:ann/target/a33b9b447b277bbdf6e973cf247ac9e6cab3499a covid:ann/target/d26f3a3006f32d0894a8485821ea81689dfa1a55 covid:ann/target/b5f4da027ccc22330bbcb0d5fe3a6afbf9329a6b covid:ann/target/42f308dda0c6b73b0db87bd7ff3b1d7c07f795c2 covid:ann/target/3be27d5d867c93cd124a3e6a5010f438ef6d35df covid:ann/target/54eefecb29cca2a3fe5d6513b19c0d3747ec847f covid:ann/target/fbb85f1dd886102b3f00af2c6dc6218148c279e9 covid:ann/target/d869002beceeb3ace652a5a6554e4e1498cfc511 covid:ann/target/bdf8971dd66b3324bf3f1818fc9a76441a21384d covid:ann/target/82dfc71edb1a23b1948346d35e363fd7247450b0 covid:ann/target/3d848b8401d3cd183848c29a563dd8c282eaa1e9 covid:ann/target/92d0e6e82ef932e748271c1798bd7657faf35d51 covid:ann/target/e8ca75b7ca4273326fb405d837b30ff8122f4ed3 covid:ann/target/c3be273cd1c0daa842c9b0444d876bdf60e9e6e7 covid:ann/target/6446c77fddc9869d7e9a197046ba850635838f0e covid:ann/target/66c0980951e9aa944e8ec0eda3c81da03bbf6a6a covid:ann/target/87cc8c74a0d738fe40b654a0f0d76ef54df333c4 covid:ann/target/acdf903011e838f598a27497e8ed46a0ef9757cd covid:ann/target/b37d2c54bd1b34cd64a03f8855e7333f0f160b01 covid:ann/target/53479c72b1faa1a67fdf1317a0b53da7b35164cd covid:ann/target/f1588c7c851d691422d99267cc83fba93b4fc505 covid:ann/target/c045cd7e792189291ed113be773efe6ffb3ae0ed covid:ann/target/01bcb0f9d8629d4f5c57a6fc169db34d7c478d06 covid:ann/target/74aa46331b16a22605782be68f52ec65efc01411 covid:ann/target/a6f02c9ac432a2d240bf0f6a3a5994af51611635 covid:ann/target/9933e963fe1d20cfd19a125f98217f405f5e1616 covid:ann/target/7f99fd3b6fec66762449c0fa6dc867a520d2c481 covid:ann/target/117b139000c9f8f849591417e58456e767bcc5c2 covid:ann/target/c390db122812f7a73bbc84722df897f39618660e covid:ann/target/1da7498a1917ad79d233c6581ffbe5b04ae44e36 covid:ann/target/c3a6d3f647e147e1437564a97d191cfcd7230532 covid:ann/target/b0705fd2faa66c558289397d7a7c8e7702b88464 covid:ann/target/c5cf4a68abdebb646d5ce30b60738e23dd96611d covid:ann/target/491f2deab5b1df951b3fefe46bd4de7af56c6a42 covid:ann/target/cb79fa6e8879a2c7aafadc7d6bb0cddc85e8d644

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2025 OpenLink Software