Evidence based alternative method of estimating Internet Users in the ...

0 downloads 188 Views 123KB Size Report
While this was based on telephone penetration, its relevance ... subscriptions with use of data communications at broadb
Evidence based alternative method of estimating Internet Users in the absence of survey data: using Sri Lanka as a case study Roshanthi Lucas Gunaratne & Rohan Samarajiva LIRNEasia ABSTRACT The present paper seeks to improve the measurement of the ICT indicator ‘proportion of individuals using the Internet’. It is a base indicator that is used in many composite indices. There are significant shortcomings in the current method of estimating the number of Internet users in countries where demand-side data is unavailable. Governments calculate the number of Internet users based on the number of subscriptions, using a multiplier, which leads to unrealistic values. This paper explores the possibility of using income and education components of the Human Development Index (HDI) to define a new index, and using it to obtain a more accurate estimate of the Internet penetration rate. A regression analysis, between the Internet penetration rate and this new index yields a direct correlation between the two. Using this data, a model was derived which enables the estimation of Internet penetration rate given the income and education level. It is proposed that this evidence based estimation of Internet penetration rate be used in the absence of demand side surveys instead of arbitrary multipliers provided by country administrations. The paper then tests this new methodology on Sri Lanka to estimate the number of Internet users and compares it with different estimates from ITU, TRCSL and the findings from the 2009 Household ICT Literacy survey.

INTRODUCTION With the growth of information and communication technologies (ICTs), and their enormous contribution to economic and social progress of countries, there is an increasing demand for accurate measurement of ICT access and use (Calderaro, 2009, ITU, 2011b). Within countries, the performance of ICT indicators is used to formulate ICT policies. Internationally they are used to measure the digital divide, to monitor progress towards the World Summit on the Information Society (WSIS)1 targets and Millennium Development Goals (MDGs)2. Within the many indicators proposed by the International Telecommunication Union (ITU) to measure the development of ICTs (Partnership on measuring ICT for Development, 2010) the present paper places the greatest emphasis on ‘proportion of individuals using the Internet’. This is partly due to the challenges of arriving at accurate and realistic estimates in the absence of up-to-date, representative survey data from countries (Beilock & Dimitrova, 2003, Donner & Toyama 2009, Donat, Brandtweiner & Kerschbaum, 2009, ITU, 2010b). It is also a base indicator that is used in many composite indices such as the ITU’s ICT Development Index (IDI), the World Economic Forum’s Network Readiness Index and the World Bank’s Knowledge Economy Index (KEI) (Dutta & Bilbao-Osori, 2012, World Bank 2008, ITU 2011b, Win Tun, 2010). It is obviously an important, and indeed indispensable, indicator when all eyes are on the emergence of an Internet Economy. Errors in such base indicators ripple through the system, sometimes diluted and sometimes accentuated. Therefore, it is imperative that best efforts be made to ensure that errors are minimized. This paper focuses on developing a new evidence-based methodology to estimate the Internet penetration rate in the absence of demand side surveys. The paper also cautions about the longevity of the present supply side indicator definitions in the face of convergence, much talked about for years and finally happening, and emphasizes the need to create 1

WSIS was held to discuss the use of Information and Communication Technologies (ICT) for development. At the conclusion, governments agreed to strive to reach ten targets related to ICTs by the year 2015 (ITU, 2011c)

2

The Millennium Development Goals (MDGs) are eight international development goals that all United Nations member states have agreed to achieve by the year 2015 (UN, 2010)

incentives for greater reliance on demand-side surveys.

Importance of Internet Penetration Rate Indicator One of the main reasons of focusing on the Internet penetration rate indicator is that it forms the base of many other indices used to measure ICT development. The most common and internationally recognized indices used to compare the ICT sector performance between countries are NRI (Networked Readiness Index), Digital Economy Index (previously e-readiness Index), KEI (Knowledge Economy Index) and IDI (ICT Development Index). Proportion of individuals using the Internet is a key sub-indicator of each of these indices (Dutta & Bilbao-Osorio 2012, The Economist Intelligence Unit, 2010, World Bank 2008, ITU 2011b). In addition to being a base indicator for each of the above indices which are used to assess countries’ ICT policies, Internet user penetration is also one of the key indicators used to assess the achievement of target eight (developing a global partnership for development) of the Millennium Development Goals (UN, 2010). It is also a key indicator in measuring the World Summit on the Information Society (WSIS) Target 10 which seeks to “Ensure that more than half the world’s inhabitants have access to ICTs within their reach and make use of them” (ITU, 2011c).

Factors that affect Internet penetration As early as 1963, Jipp illustrated that there is a strong correlation between teledensity and economic development in a country (Jipp, 1963). While this was based on telephone penetration, its relevance can be extended to Internet penetration today. In addition to income, an Internet user also must be literate in order to be able to make the maximum use of the Internet (Chaudhuri, Flamm & Horrigan, 2005, Hilbert 2012). Hilbert & Peres had conducted a multivariate discriminative analysis of ten attributes (including Education, Income, Household Size, Age, Gender and Ethnicity) in South American countries, testing for household Internet access penetration (Hilbert & Peres 2010). They came to the conclusion that out of the variables tested the main two factors driving Internet penetration are income and education. In addition, according to the ITU survey data (ITU 2011b), countries with high income are the ones with the highest Internet and computer penetration. The ITU survey data also show that there is a correlation between education level and Internet penetration rate (ITU 2011b).

Problems with the current method of estimating Internet users The most reliable way of measuring Internet users is through demand-side or household surveys (ITU 2011a). The Core ICT Indicators 2010 published by the Partnership on Measuring ICT for Development captures this through indicator HH7 (Proportion of individuals who used the Internet (from any location) in the last 12 months), among others. However, demand-side surveys are costly and difficult to organize. Many countries do not conduct regular surveys on ICTs. According to ITU only 37% of countries have conducted demand side surveys (ITU 2011b). The current method for measuring Internet users in the absence of survey data is to derive an estimate based on the number of total Internet subscriptions, using a multiplier to account for people who use Public Internet Access Points (PIAPs) or people who use the Internet at their work places, schools or other public locations. The latest definition by the ITU for the Estimated Internet Users (indicator 4212, in ITU 2010a) states that “In situations where surveys are not available, an estimate can be derived based on the number of Internet Subscriptions”. Theoretically, the Number of Internet Subscriptions should be the sum of Internet subscriptions of all types/technologies and all speeds including both fixed (wired) and mobile (wireless) Internet subscriptions. However, this method of adding subscriptions could lead to significant over-counting. Furthermore, mobile broadband subscriptions are measured by the indicator “Standard mobile subscriptions with use of data communications at broadband speeds” (indicator 271mb_use, as per ITU 2010a). This measures subscriptions with potential access rather than actual active subscriptions. In the past few years, OECD countries have started to report active mobile broadband subscriptions. 2

While some countries over-report, in other countries this indicator is under-reported due to use of mobile data from post-paid connections. For example in Sri Lanka, all SIMs provided by a major operator are data enabled. Therefore, even without a specific data plan, any customer with a datacompatible mobile phone can use the Internet, but they are not all counted by the operator and thus not reported. As a result, data have become incomparable across countries, with some countries reporting potential access, others active use and yet others none at all. Therefore ITU is currently in the process of trying to harmonize these different types of data and has requested countries to report only active mobile broadband connections (ITU, 2011b). In addition to the issues of estimating the total number of subscriptions, ITU has allowed national administrations to use different multipliers at their discretion in order to estimate the number of users from the number of subscriptions. Due to the use of arbitrary multipliers, comparison between countries is not possible. However ITU is trying to discourage this practice. In 2010 ITU stopped collecting this indicator and is rather collecting the proportion of Internet users (from the total population). This intends to send the message to countries that the proportion should come from sampling surveys or censuses when available. Even if the indicator is expressed in proportion, many countries are still using the multiplier from the number of subscriptions in order to calculate it. Therefore, naturally, a question may be raised on the possibility of larger multipliers being used to show a higher number of Internet users in a country. Due to the importance of this indicator and these issues a different methodology to estimate the Internet penetration of a country is proposed in this study.

AN ALTERNATIVE METHOD OF ESTIMATING INTERNET PENETRATION RATE Due to the flaws in both calculation of total subscriptions and the arbitrariness of the multiplier a completely different method of estimating the number of Internet users for countries which have not conducted demand side surveys is proposed in this study. It is proposed that be used to estimate the Internet penetration based on the Income and Education components of the Human Development Index (HDI) when survey data is unavailable.

Methodology In section 1.2 it is observed that income and education appear to be the main factors influencing Internet penetration in a country. Therefore the direct correlation between Internet penetration and income level as well as education level was explored. In order to test the hypothesis a regression analysis was carried out on the actual Internet penetration rates (for countries where demand side surveys on Internet use has been conducted) and the Index created only using the Education and Income components of the Human Development Index (HDI). This new index will be defined as HDI_EdGNI in the rest of the paper. HDI was developed by UNDP by combining indicators of life expectancy, educational attainment and income into a composite human development index. The new index (HDI_EdGNI) was created using only the education and income components of HDI since there is no evidence that Internet penetration is correlated with life expectancy. In HDI_EdGNI, income and education have been given equal weightage. The education component of the HDI is measured by mean of years of schooling for adults aged 25 years and expected years of schooling for children of school going age. The income component is measured by GNI per capita (PPP$). The HDI uses the logarithm of income, to reflect the diminishing importance of income with increasing GNI (UNDP, 2011). Figure 1 shows the correlation between HDI_EdGNI Index and the Internet penetration. The outlier Morocco was removed from the dataset to improve the fit of the model as it had high Internet penetration, but low GNI and education level. Regression analysis gives the best fit for the correlation, with adjusted R squared value of 8.44, as in equation 1. 3

Figure 1: Internet penetration rate correlated with education and income components of HDI Index of countries for which demand-side survey results are available Source: Authors

y = 0.4 × e 6 x

(1)

where x = new index HDI_EdGNI y = proportion of Internet users Survey Data (2008 - 2010) was obtained from ITU Market Information and Statistics division. Since there is a lack of African survey data in the ITU set, it has been complemented by data from household surveys conducted by Research ICT Africa (RIA) in 17 African countries at the end of 2007 and beginning of 2008 (Gillwald & Stork 2008). HDI data is from 2010 as over 60% of the survey data was from 2010. Since surveys will only be conducted once every 3 or 4 years, in between surveys, the increase can be estimated assuming steady growth based on historical survey data. If it is the first demand side survey on Internet use for the country, the growth rate of a similar country in the region with similar GNI and Education level can be used as a proxy. As this data was not available at hand when writing this report the model has been developed using existing data. The model can and should evolve with time, especially as people obtain more Internet connections in their homes, and more countries conduct demand-side surveys, the formula will become more accurate. Therefore, it is necessary to re-analyse the existing demand-side Internet user data with education and income components of HDI and creating a new model to estimate the proportion of Internet users annually. This can be done soon after the UNDP publishes the HDI index at the end of each year. Figure 2 illustrates this procedure. This reduces the uncertainty and inconsistency in the way the Internet penetration is calculated and is a more scientific method. Countries which want to overcome this limitation will be encouraged to conduct demand-side surveys.

4

Start

Yes

Demand side surveys available for relevant year?

Use demand side survey data

No

Demand side survey conducted previously

No

Estimate Internet penetration rate based on new methodology

End Yes End Use Internet penetration growth rate of similar country in region with similar Income level which has conducted more than one survey to estimate current Internet penetration rate

End

No

More than one survey has been conducted?

Yes Calculate annual Internet penetration growth for country and estimate current Internet penetration rate based on previous survey data

End

Figure 2 – Flow chart of steps for proposed process for arriving at more reasonable estimates for Internet Users. Source: Authors 5

Testing the new indicator estimation method in Sri Lanka The proposed methodology can be demonstrated using Sri Lanka as a test case. As at 2010 the Internet penetration rate of Sri Lanka was 12% (ITU database). The Telecommunication Regulatory Commission Sri Lanka (TRCSL) website gives 280,000 as the number of “Internet and email subscribers - fixed” as at Dec 2010, a rather mysterious and undefined indicator that does not seem to have international comparators. Even though ITU data does not state that Sri Lanka has had a demand side survey, in 2009 the Department of Census and Statistics, Sri Lanka conducted a Household ICT literacy survey. According to this survey Sri Lanka had “an Internet using household population" (aged 5 – 69) of 13.1% in 2009(Department of Census and Statistics Sri Lanka, 2009). Since a survey has already been conducted, according to the proposed method of estimation, it is not necessary to use the formula. We should simply increase the 13.1% penetration rate as at the last demand side survey in 2009 by the corresponding growth rate to estimate the figure for 2010. According to the 2004 Computer Literacy Survey conducted by the Department of Census and Statistics Only 3% individuals were able to use the Internet (Satharasinghe, 2004). Therefore Internet penetration rate has increased by 2% annually. Extending this value to 2010 gives us an Internet penetration of 15.1% according to the new method. Though ITU recommends use of demand side survey data, they do not seem to be taking the data of the 2009 ICT literacy survey into account, nor do they seem to be using the TRCSL data, so where ITU obtains a penetration rate of only 12% is unclear. Considering that demand side surveys are the best method of estimation of Internet penetration rate, it is important that this figure is used in national and international reporting.

CONCLUSION This paper proposes an evidence based improvement to the method of measuring Internet penetration rate, which uses the existing, but incomplete demand-side data and modelling. There are several shortcomings in the current method of estimating the number of Internet users in countries where demand-side data is unavailable (Beilock et al 2003, Donner et al 2009, ITU 2010b). In the absence of demand-side surveys, governments calculate number of Internet users based on the number of subscriptions, using a multiplier (ITU 2009). There are no guidelines or methodologies to estimate this multiplier and is left to the discretion of the country administrations. This could lead to the question of whether unreasonably high multipliers being used by administration to show high growth rates. In addition there are many difficulties in estimating total subscriptions in a country due to mobile data sometimes not being reported, other times being reported as number of potential subscriptions. This paper explores the possibility of using readily available income and education data (components of the Human Development Index (HDI)) to derive a more accurate and consistent estimate of the proportion of individuals using the Internet in the absence of demand side surveys. The proposed method creates strong incentives for countries to conduct demand-side surveys because that is the only way they can escape the constraint of the mathematically derived estimate. If a national administration believes that it actually has implemented policies that have resulted in a higher number of Internet users than the model predicts, all it has to do is to conduct a demand-side survey to demonstrate its success. Studying the various estimations for the Internet penetration rate of Sri Lanka, it is important that the demand side survey results are communicated both nationally and internationally, especially as it gives a higher penetration rate than the one proposed by ITU.

6

REFERENCES Beilock, R. and Dimitrova D. V. (2003), An exploratory model of inter-country Internet diffusion, retrieved from http://danielad.jlmc.iastate.edu/docs/intercountry.pdf Calderaro, A. (2009). “The Digital Divide, framing and mapping the phenomenon”. In Ferro E., Dwivedi Y.K., Gil-Garcia J.R., Williams M.D. (Eds.), Handbook of Research on Overcoming Digital Divides. Hershey: IGI Global Chaudhuri, A., Flamm, K. S., & Horrigan, J. (2005). An analysis of the determinants of internet access. Telecommunications Policy, 29(9-10), 731-755. doi:10.1016/j.telpol.2005.07.001 Chinn, M. D., Fairlie R. W. (2010), ICT Use in the Developing World: An Analysis of Differences in Computer and Internet, Review of International Economics, 18(1), 153–167, http://www.google.lk/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0CCEQFjAA&url=http %3A%2F%2Fwww.ssc.wisc.edu%2F~mchinn%2Fchinn_fairlie_RIE2010.pdf&ei=BnSiT9KlC4nqrA fcv9m2Bw&usg=AFQjCNHvmg1_MDlA3VbK1iuMlIVL9JWJHw CKS Consulting Pvt. Ltd (2009),Teleuse@BOP3 a Qualitative Study, Colombo: LIRNEasia Department of Census and Statistics Sri Lanka, 2009, Computer Literacy Survey - 2009 retreived from http://www.statistics.gov.lk/CLS/BuletinComputerLiteracy_2009.pdf Donat E., Brandtweiner R., Kerschbaum A. (2009) Attitudes and the Digital Divide: Attitude Measurement as Instrument to predict Internet Usage, in International Journal of an Emerging Transdiscipline, Volume 12 Donner, J., & Toyama, K. (2009), Persistent themes in ICT4D Research: priorities for intermethodological exchange, retrieved from http://www.jonathandonner.com/DonnerToyama_ISI2009prepub.pdf Dutta, S., Bilbao-Osorio B. 2012, The Global Information Technology Report 2012:Living in a Hyperconnected, Geneva: World Economic Forum Hilbert, M., (2011) The end justifies the definition: The manifold outlooks on the digital divide and their practical usefulness for policy-making, Telecommunication Policy September 2011 35: pp 715736 Hilbert, M., & Peres, W. (Eds.). (2010). Information societies in Latin America and the Caribbean: Development of technologies and technologies for development. ECLAC Books. Santiago: United Nations Economic Commission for Latin America and the Caribbean. Retrieved from http://www.cepal.org/socinfo/publicaciones/default.asp?idioma=IN Gillwald, Alison and Stork, Christoph, 2008, Towards Evidence based ICT Policy and Regulation: ICT access and usage in Africa – Volume One 2008 paper two, retrieved from http://www.researchictafrica.net/publications/Towards_Evidencebased_ICT_Policy_and_Regulation__Volume_1/RIA%20Policy%20Paper%20Vol%201%20Paper%202%20%20ICT%20Access%20and%20Usage%20in%20Africa%202008.pdf ITU (1984), The Missing Link: Report of the Commission for World-Wide Telecommunications Development, Geneva: ITU ITU (2010a) Definitions of World Telecommunication/ICT Indicators March 2010, Geneva: ITU. Access from http://www.itu.int/ITUD/ict/material/TelecomICT%20Indicators%20Definition_March2010_for%20web.pdf . ITU (2011a) Handbook for the collection of administrative data on telecommunications/ICT, Geneva: ITU 7

ITU (2011b) Measuring the Information Society, Geneva: ITU ITU (2011c) Measuring the WSIS Targets, a statistical framework, Geneva: ITU Jipp, A. (1963). Wealth of nations and telephone density. Telecommunications Journal, July 1963, pp. 199-201. LIRNEasia (2010). Teleuse at the bottom of the pyramid data booklet: Findings from a six country study. p.3. Available at: http://lirneasia.net/wp-content/uploads/2008/04/Teleuse-at-BOP-DataBooklet-LIRNEasia.pdf Partnership on Measuring ICT for Development (2009), Revisions and additions to Core ICT Indicators 2010, Geneva: ITU Partnership on Measuring ICT for Development (2010), Core ICT Indicators 2010, Geneva: ITU Samarajiva, R (2009). How the developing world may participate in the global Internet economy: Innovation driven by competition, in ICTs for development: Improving policy coherence, pp. 75-118. Paris: OECD. http://www.oecd.org/dataoecd/39/15/44003919.pdf Samarajiva, R. & Lucas R. (2010), Improving measurement of progress toward Target 10 of the World Summit on Information Society (WSIS), retrieved from http://lirneasia.net/wpcontent/uploads/2010/10/WSIS_29Sep10.pdf Satharasinghe, A., (2004), Computer Literacy of Sri Lanka – 2004, Retrieved from http://www.statistics.gov.lk/CLS/ Telecommunication Regulatory Commission Sri Lanka (TRCSL) website, 2010, Statistics, Retrieved from http://www.trc.gov.lk/information/statistics.html The Economist Intelligence Unit, 2010, Digital economy rankings 2010:Beyond e-readiness, retrieved from, http://graphics.eiu.com/upload/EIU_Digital_economy_rankings_2010_FINAL_WEB.pdf Treasure, J.A.P. (1953). Retail audit research. The Incorporated Statistician. 4(3): 148-59 http://www.jstor.org/pss/2988333. United Nations (2010) Millennium Development Goals report, New York: United Nations Department of Economic and Social Affairs United Nations Conference on Trade and Development (UNCTAD) (2011), Information Economy Report 2011, ICTs as an Enabler for Private Sector Development, Geneva: UNCTAD United Nations Development Programme (2011) Human Development Report 2011, Sustainability and Equity: A Better Future for All, New York: UNDP Win Tun, Haymar (2010) Building Knowledge Economies, Application of Knowledge Assessment Methodology (KAM). Policy analysis exercise. Lee Kuan Yew School of Public Policy, National University of Singapore. World Bank (2010b), Gross national income per capita 2009, Atlas method and PPP, http://siteresources.worldbank.org/DATASTATISTICS/Resources/GNIPC.pdf Wortham, J. (2010, May 13). Cellphones now used more for data than for calls. New York Times. http://www.nytimes.com/2010/05/14/technology/personaltech/14talk.html

8