Overview

Dataset statistics

Number of variables5
Number of observations189
Missing cells188
Missing cells (%)19.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory47.9 KiB
Average record size in memory259.5 B

Variable types

CAT3
URL1
DATE1

Reproduction

Analysis started2020-02-14 00:06:14.196809
Analysis finished2020-02-14 00:06:15.577118
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
notes has 188 (99.5%) missing values Missing

Variables

url
URL

UNIQUE
Distinct count189
Unique (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
https://www.privacyinternational.org/
 
1
http://web.amnesty.org/
 
1
http://yekolotemari.blog.com/
 
1
http://www.tegbar.org/
 
1
http://www.ultrasurf.us/
 
1
Other values (184)
184
ValueCountFrequency (%) 
https://www.privacyinternational.org/ 1 0.5%
 
http://web.amnesty.org/ 1 0.5%
 
http://yekolotemari.blog.com/ 1 0.5%
 
http://www.tegbar.org/ 1 0.5%
 
http://www.ultrasurf.us/ 1 0.5%
 
http://www.uneca.org/index.htm 1 0.5%
 
http://www.savenega.org/Welcome.html 1 0.5%
 
http://www.tzta.ca/ 1 0.5%
 
http://stream.aljazeera.com/story/201306250132-0022854 1 0.5%
 
https://www.twitter.com/ 1 0.5%
 
Other values (179) 179 94.7%
 
ValueCountFrequency (%) 
http 173 91.5%
 
https 16 8.5%
 
ValueCountFrequency (%) 
nazret.com 8 4.2%
 
www.cafpde.org 3 1.6%
 
www.hrw.org 3 1.6%
 
www.ethiopiafirst.com 2 1.1%
 
www.andenet.com 2 1.1%
 
www.bds-ethiopia.net 2 1.1%
 
www.unido.org 2 1.1%
 
www.savenega.org 2 1.1%
 
www.populationmedia.org 2 1.1%
 
www.cyberethiopia.com 2 1.1%
 
Other values (134) 161 85.2%
 
ValueCountFrequency (%) 
/ 127 67.2%
 
/blog/index.php 7 3.7%
 
/index.html 2 1.1%
 
/index.htm 2 1.1%
 
/Herald/articlefront.asp 1 0.5%
 
/s/n65b3d67f82asn2/Leaked%20National%20Entrance%20Exam_English.pdf 1 0.5%
 
/cafpde/ 1 0.5%
 
/xhtml 1 0.5%
 
/~ena/ 1 0.5%
 
/regions_06/africa_06/africa_06.html 1 0.5%
 
Other values (45) 45 23.8%
 
ValueCountFrequency (%) 
174 92.1%
 
Itemid=52&id=18&option=com_content&task=view 1 0.5%
 
feed=5&how=paged&what=all 1 0.5%
 
blog=9 1 0.5%
 
blog=14 1 0.5%
 
iso=eth 1 0.5%
 
blog=12 1 0.5%
 
country=231&region=2&section=9&sub_section=2 1 0.5%
 
cc=ETH 1 0.5%
 
id_rubrique=20 1 0.5%
 
Other values (6) 6 3.2%
 
ValueCountFrequency (%) 
188 99.5%
 
ethiopia 1 0.5%
 

category_code
Categorical

Distinct count15
Unique (%)7.9%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
NEWS
65
HUMR
45
POLR
32
ECON
 
13
ANON
 
8
Other values (10)
26
ValueCountFrequency (%) 
NEWS 65 34.4%
 
HUMR 45 23.8%
 
POLR 32 16.9%
 
ECON 13 6.9%
 
ANON 8 4.2%
 
CULTR 7 3.7%
 
XED 5 2.6%
 
HOST 3 1.6%
 
MISC 3 1.6%
 
PUBH 2 1.1%
 
Other values (5) 6 3.2%
 

Length

Max length5
Mean length4
Min length3
ValueCountFrequency (%) 
Uppercase_Letter 21 100.0%
 
ValueCountFrequency (%) 
Latin 21 100.0%
 
ValueCountFrequency (%) 
ASCII 21 100.0%
 
Distinct count6
Unique (%)3.2%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
Minimum2014-04-15 00:00:00
Maximum2018-04-10 00:00:00
Histogram

source
Categorical

Distinct count5
Unique (%)2.6%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
citizenlab
178
OONI
 
4
CIPIT
 
4
BBC
 
2
defenddefenders
 
1
ValueCountFrequency (%) 
citizenlab 178 94.2%
 
OONI 4 2.1%
 
CIPIT 4 2.1%
 
BBC 2 1.1%
 
defenddefenders 1 0.5%
 

Length

Max length15
Mean length9.71957672
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 13 65.0%
 
Uppercase_Letter 7 35.0%
 
ValueCountFrequency (%) 
Latin 20 100.0%
 
ValueCountFrequency (%) 
ASCII 20 100.0%
 

notes
Categorical

MISSING
Distinct count1
Unique (%)100.0%
Missing188
Missing (%)99.5%
Memory size1.6 KiB
Reportedly blocked
1
ValueCountFrequency (%) 
Reportedly blocked 1 0.5%
 
(Missing) 188 99.5%
 

Length

Max length18
Mean length3.079365079
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 13 86.7%
 
Uppercase_Letter 1 6.7%
 
Space_Separator 1 6.7%
 
ValueCountFrequency (%) 
Latin 14 93.3%
 
Common 1 6.7%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Correlations

Missing values

Sample

First rows

urlcategory_codedate_addedsourcenotes
0http://abrahadesta.wordpress.com/CULTR2014-04-15citizenlabNaN
1http://aljazeera.net/NEWS2014-04-15citizenlabNaN
2http://am.wikipedia.org/MISC2014-04-15citizenlabNaN
3http://am.wikipedia.org/wiki/%E1%8B%8B%E1%8A%93%E1%8B%8D_%E1%8C%88%E1%8C%BDMISC2014-04-15citizenlabNaN
4http://amharic.voanews.com/NEWS2014-04-15citizenlabNaN
5http://ancientgebts.org/HUMR2014-04-15citizenlabNaN
6http://carpediemethiopia.blogspot.com/POLR2014-04-15citizenlabNaN
7http://citizenlab.org/NEWS2014-04-15citizenlabNaN
8http://cpj.org/NEWS2014-04-15citizenlabNaN
9http://egoportal.blogspot.com/POLR2014-04-15citizenlabNaN

Last rows

urlcategory_codedate_addedsourcenotes
179https://www.citizenlab.org/NEWS2014-04-15citizenlabNaN
180https://www.dropbox.com/s/n65b3d67f82asn2/Leaked%20National%20Entrance%20Exam_English.pdf?dl=0FILE2016-05-30OONINaN
181https://www.facebook.com/JawarmdNEWS2016-05-30OONINaN
182https://www.facebook.com/pages/Addis-Neger/49967100821NEWS2014-04-15citizenlabNaN
183https://www.hrw.org/HUMR2014-04-15citizenlabNaN
184https://www.mereja.com/NEWS2016-09-09CIPITNaN
185https://www.oromiamedia.org/NEWS2016-05-30OONINaN
186https://www.privacyinternational.org/HUMR2014-04-15citizenlabNaN
187https://www.torproject.org/NEWS2014-04-15citizenlabNaN
188https://www.twitter.com/HOST2014-04-15citizenlabNaN