Zahra_Hammook.pdf (4.82 MB)
Download file crawler and a new contribution determines sequence (CDS) method

Download (4.82 MB)
posted on 2022-10-20, 14:44 authored by Hammook Zahra
General and Focus crawlers are the main types of web crawlers used for different goals, with different crawling techniques and architecture. Our crawler was written in Java language using different software and libraries. To test the crawler, it has been run on the academic social network, from 3 rd.April to 28th.June 2014 and retrieved real data. The crawler consists of three main algorithms to crawl information such as researchers details, publications details, questions/answers activity details. The retrieved data has been analyzed to highlight the performance of Canadian researchers, in the field of Computer Science on Data analysis has been done from the collaboration and (alt)metrics perspectives. Among other features came with “Impact Points” and “RG Score” (alt)metrics. The former builds on ISI Journal Impact Factor, which disregards author’s contribution in its calculations. A new Contribution Determines Sequence (CDS) method has been developed and tested, with all required scripts which showed better performance than other methods.



Master of Science


Computer Science

Granting Institution

Ryerson University

LAC Thesis Type


Usage metrics