The data needed to complete the GIS-based model is found in table 10 below. Each dataset may have been used multiple times or with different specifications depending on broadband type. This table summarizes the data as it was collected, and from what source.
Some of these data layers need to be preprocessed before being used within the final analysis. To start, the Cell Tower Nodes dataset was broken down into small portions, as the number of points made further processing steps difficult. The Cartographic Boundaries and Population Dwelling and Counts data sets were joined together to form a single layer to properly weigh them as a single factor. Finally, the CRTC Internet Coverage data was originally in a KML format, which had to be converted into a shapefile before being worked on.
There are a few limitations that need to be addressed. Firstly, the data source for Installation Costs has a wide range of entry years, some being more recent than others. Because of this, it will be assumed that the 2014 adjusted cost per unit displayed in the data source is accurate enough for this analysis. Second, the Inhabited Land data set excludes all areas where the population is below 1000 and the density is less than 0.4 people per square kilometre. As such, all of these areas will be considered isolated and will be listed as areas of need for simplicity. Finally, one of the data sources is open source, which can potentially be incorrect or not up to date. The assumption for this analysis is that the data source is accurate enough since most of the results will not rely on high precision.