Kafka-ML: Connecting the data stream with ML/AI frameworks

dc.bibliographicCitation.firstPage15eng
dc.bibliographicCitation.lastPage33eng
dc.bibliographicCitation.volume126eng
dc.contributor.authorMartín, Cristian
dc.contributor.authorLangendoerfer, Peter
dc.contributor.authorZarrin, Pouya Soltani
dc.contributor.authorDíaz, Manuel
dc.contributor.authorRubio, Bartolomé
dc.date.accessioned2022-02-16T06:39:30Z
dc.date.available2022-02-16T06:39:30Z
dc.date.issued2022
dc.description.abstractMachine Learning (ML) and Artificial Intelligence (AI) depend on data sources to train, improve, and make predictions through their algorithms. With the digital revolution and current paradigms like the Internet of Things, this information is turning from static data to continuous data streams. However, most of the ML/AI frameworks used nowadays are not fully prepared for this revolution. In this paper, we propose Kafka-ML, a novel and open-source framework that enables the management of ML/AI pipelines through data streams. Kafka-ML provides an accessible and user-friendly Web user interface where users can easily define ML models, to then train, evaluate, and deploy them for inferences. Kafka-ML itself and the components it deploys are fully managed through containerization technologies, which ensure their portability, easy distribution, and other features such as fault-tolerance and high availability. Finally, a novel approach has been introduced to manage and reuse data streams, which may eliminate the need for data storage or file systems.eng
dc.description.versionpublishedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/8019
dc.identifier.urihttps://doi.org/10.34657/7060
dc.language.isoengeng
dc.publisherAmsterdam [u.a.] : Elsevier Scienceeng
dc.relation.doihttps://doi.org/10.1016/j.future.2021.07.037
dc.relation.ispartofseriesFuture generation computer systems 126 (2022)eng
dc.relation.issn0167-739X
dc.rights.licenseCC BY 4.0 Unportedeng
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/eng
dc.subjectApache Kafkaeng
dc.subjectArtificial Intelligenceeng
dc.subjectData streamseng
dc.subjectDistributed systemseng
dc.subjectDockereng
dc.subjectKafka-MLeng
dc.subjectKuberneteseng
dc.subjectMachine Learningeng
dc.subject.ddc004eng
dc.titleKafka-ML: Connecting the data stream with ML/AI frameworkseng
dc.typearticleeng
dc.typeTexteng
dcterms.bibliographicCitation.journalTitleFuture generation computer systemseng
tib.accessRightsopenAccesseng
wgl.contributorIHPeng
wgl.subjectInformatikeng
wgl.typeZeitschriftenartikeleng
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
10.1016_j.future.2021.07.037.pdf
Size:
1.8 MB
Format:
Adobe Portable Document Format
Description:
Collections