Publication:
Implementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysis

dc.citedby1
dc.contributor.authorMohd Pakhari M.H.en_US
dc.contributor.authorJamil N.en_US
dc.contributor.authorRusli M.E.en_US
dc.contributor.authorAbdul Rahim A.A.en_US
dc.contributor.authorid57220805194en_US
dc.contributor.authorid36682671900en_US
dc.contributor.authorid16246214600en_US
dc.contributor.authorid57220806943en_US
dc.date.accessioned2023-05-29T08:08:19Z
dc.date.available2023-05-29T08:08:19Z
dc.date.issued2020
dc.descriptionData handling; Engines; Information use; Pattern matching; Cyber threats; Public resources; Structured data; Threat analysis; Unstructured data; Classification (of information)en_US
dc.description.abstractCyber Threat Intelligence (CTI) is a concept for information about cyber threats which were analysed, structured, and refined. This information is used to help organizations to understand the current risk that have different levels that might bring harm to their enterprises. Besides, CTI can also help organizations to plan for defensive countermeasures and protect themselves from the attacks that can cause them damage. In this paper, we introduce a token parsing technique for regex based classification of unstructured data for cyber threat analytic (CTA) engine that does threat analysis based on data crawled from several public resources. Our engine crawls and fetch data from the public resource in time series, analyse the data and provide a meaningful information to the user with the timeline of the fetched parameter. The collected data which appears as non-structured are converted by the engine to appear as a structured data and then be inserted into the database. Subsequently, the engine then analyses the threat data by modelling it before useful information be returned to the user. The challenge is to have a structured data useful for analysis. This paper explains how our token parsing technique is useful in regex based classification to convert the unstructured data into useful structured data. � 2020 IEEE.en_US
dc.description.natureFinalen_US
dc.identifier.ArtNo9243415
dc.identifier.doi10.1109/ICIMU49871.2020.9243415
dc.identifier.epage398
dc.identifier.scopus2-s2.0-85097642842
dc.identifier.spage395
dc.identifier.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85097642842&doi=10.1109%2fICIMU49871.2020.9243415&partnerID=40&md5=72dc48523a5d04414d202f37cb40d776
dc.identifier.urihttps://irepository.uniten.edu.my/handle/123456789/25340
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.sourceScopus
dc.sourcetitle2020 8th International Conference on Information Technology and Multimedia, ICIMU 2020
dc.titleImplementation of Token Parsing Technique for Regex Based Classification of Unstructured Data for Cyber Threat Analysisen_US
dc.typeConference Paperen_US
dspace.entity.typePublication
Files
Collections