Extracting Topics from Open Educational Resources

dc.contributor.authorMolavi, Mohammadreza
dc.contributor.authorTavakoli, Mohammadreza
dc.contributor.authorKismihók, Gábor
dc.date.accessioned2021-05-07T06:54:48Z
dc.date.available2021-05-07T06:54:48Z
dc.date.issued2020
dc.description.abstractIn recent years, Open Educational Resources (OERs) were earmarked as critical when mitigating the increasing need for education globally. Obviously, OERs have high-potential to satisfy learners in many different circumstances, as they are available in a wide range of contexts. However, the low-quality of OER metadata, in general, is one of the main reasons behind the lack of personalised services such as search and recommendation. As a result, the applicability of OERs remains limited. Nevertheless, OER metadata about covered topics (subjects) is essentially required by learners to build effective learning pathways towards their individual learning objectives. Therefore, in this paper, we report on a work in progress project proposing an OER topic extraction approach, applying text mining techniques, to generate high-quality OER metadata about topic distribution. This is done by: 1) collecting 123 lectures from Coursera and Khan Academy in the area of data science related skills, 2) applying Latent Dirichlet Allocation (LDA) on the collected resources in order to extract existing topics related to these skills, and 3) defining topic distributions covered by a particular OER. To evaluate our model, we used the data-set of educational resources from Youtube, and compared our topic distribution results with their manually defined target topics with the help of 3 experts in the area of data science. As a result, our model extracted topics with 79% of F1-score.eng
dc.description.versionacceptedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/6170
dc.identifier.urihttps://doi.org/10.34657/5217
dc.language.isoengeng
dc.publisherIthaca, NY : Cornell Universityeng
dc.relation.ispartofseriesarXiv (2020)eng
dc.relation.otherarXiv:2006.11109
dc.rights.licenseCC BY-NC-SA 4.0 Unportedeng
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/4.0/eng
dc.subjectOpen Educational Resourceeng
dc.subjectOEReng
dc.subjectTopic Extractioneng
dc.subjectText Miningeng
dc.subjectMachine Learningeng
dc.subject.ddc004eng
dc.titleExtracting Topics from Open Educational Resourceseng
dc.typebookParteng
dc.typeTexteng
dcterms.bibliographicCitation.journalTitlearXiveng
tib.accessRightsopenAccesseng
wgl.contributorTIBeng
wgl.subjectErziehung, Schul- und Bildungsweseneng
wgl.typeBuchkapitel / Sammelwerksbeitrageng
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Molavi2020, Extracting Topics from Open Educational Resources.pdf
Size:
333.48 KB
Format:
Adobe Portable Document Format
Description: