This is an essay. It contains the advice or opinions of one or more Wikipedia contributors. This page is not an encyclopedia article, nor is it one of Wikipedia's policies or guidelines, as it has not been thoroughly vetted by the community. Some essays represent widespread norms; others only represent minority viewpoints. |
While the official count is somewhere around 750,000, having looked through quite a few of these articles, I've noticed that there are a number of these that would be hard to call "articles". Some are a sentence long, some are disambiguation pages, some are more of a chart or a list than an article (not to say they aren't necessarily useful, but the term "article" might be a bit of a stretch). So I decided to explore what these 750,000 articles contain. To aid me in this I have used the random article option. Now, it's been a while since I've taken a class in probability and the like, but it is my understanding that, if truly random, a pretty small sample can yield pretty accurate results. This is assuming the random article button is truly random (and I think it is). I have cateogrized the results into several categories:
Note this is more about types of articles than article quality. In my notes I originally had "full artciles" down as "good articles", but decided to change it. Since I won't be reading anything but the stubs and substubs in their entirety, I cannot judge the accuracy or quality of them (even if I read them completely, I still couldn't do so without substantial research, which would obviously slow this project down immensely). There are also articles, while full and complete, I wouldn't necessarily call "good". I didn't want to have to make a separate category for fancruft or anything, as that would obviously be severely subjective. At some later point, I think I may undertake another such project in which I categorize random full articles into subject, paying particular attention to the amount of fiction. One criticism (not entirely unfounded) that Wikipedia often garners is its level of detail in TV shows, sci-fi, anime, etc., potentially at the expense of other subjects (though whether this is really at any expense is clearly debatable).
The number of random articles I explored was 500. This should give me a pretty accurate reading. I'll have to look into calculating the margin of error (if any math folks who want to help I'd be grateful).
If anyone knows of a similar project by another Wikipedian, I'd be very curious to see it. Any feedback on this I'd love to hear. Leave it on my talk page.