Posted on April 28th, 2010 at 11:05 PM by Wan
Listen with webreader

Internet Archive is an Internet non-profit library that offers permanent access for researchers, historians and scholars, people with disabilities and the general public to historical collections that exist in digital format.  Services offered includes texts, audio, moving images, software as well as archived web pages.  Its largest collection is its web archive which is mirrored at the Bibliotheca Alexandrina in Egypt. It allows user to both upload and download digital material and provides unrestricted online access for FREE.

Marxists Internet Archive (MIA) is another similar comparison to Internet Archive. It is also a non-profit organisation that provides multi-lingual, more than 40 languages.  Unfortunately, it faced a problem of Denial-of-service attacks from China in January 2007.  The seriousness of attacks coupled with other hosting issues, led to its closure for several weeks, thus making users lose faith using their services.

Internet Archive harnessed the long tail and leverage customer self-service by referring to the vast number of small sites that made up of:

  • Wayback Machine, a digital time capsule service that allows users to see archived version of web pages across time.
  • Prelinger Archive, a collection of films.
  • Open Library, a comprehensive online database of books that is still in beta.
  • Live Music Archive, a collection of audio.

They make everything available by embracing the niches. The Open Library has more than 25 millions of books online that oversees one of the world’s largest book digitisation projects whereas, Live Music Archive has over 50,000 audio from independent artists.  Users could buy from Amazon, Abebooks, Alibris, Barnes and Noble, Powells or borrow from the Worldcat library, BookMooch, Title Trader or search through Google Book. This “live web” uses RSS aggregator technology that allows user to link not just to a page but also subscribe to it with notification every time the page changes.

It harness collective intelligence through positive feedback loop and more algorithm-fueled recommendations, where it guides users by following their likes and dislikes, therefore easing their exploration of the unknown to find their requirements using “More like this” statement towards the chosen subjects items.  It leverages the “wisdom of crowd” on the open-source media collections that are contributed implicitly by the public. Recently, the Archive has begun working to provide specialized services relating to the information access needs of the print-disabled. TechRepublic podcast stated that one of best practices in web 2.0 application is ability to provide usability for the disable communities. Hence, it provides specialized services for adaptive reading and information access for the blind and person with disabilities.

The future looks promising for the Internet Archive as it offers more books, free music concerts, TV programs and a “snapshot” of World Wide Web each day capturing many implicit contributions from a large number of users.



An Internet Archive Photo Gallery (Require Flash)



The Internet Archive, In Depth

REFERENCES:

Anderson, C. (2004). The Long Tail. Retrieved April 28, 2010 from Electronic Book Library database.

Internet Archive. (2001). Retrieved April 28, 2010.

Marxists Internet Archive. (n.d). Retrieved April 28, 2010.

O’Reilly Media. (2010).What is Web 2.0: Web 2.0 Design Patterns. Retrieved April 27, 2010.

15 Responses to “ LEVERAGING THE LONG TAIL USING INTERNET ARCHIVE ”

  • HAO LONG says:

    hey again,
    GOOD ONE! i think in future we just use webiste to work and study , hows that good. and how the web2.0 applications works, we can search on the internet and watch movie or something, it quite different from others.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

  • Kelly says:

    Hi Wan, great blog topic and nice example of leveraging the long tail. I think of the interesting point is Internet Archive aiming to make online digital library to be universal access to all knowledge. To allow them to achieve that, they provided audio function which means the disable people will be able to access information as well. This is different approach to other similiar digital library.

  • Hajah says:

    Can u explain more how internet Archive spreads its services?

    • Wan says:

      Thanks Hajah. I believe Internet Archive spreads its services in many ways. One way is by having other libraries and other institutions uses their services globally worldwide. Hence, by users experiencing this site, it could even spreads further with their word of mouth. This is far by more effective.

  • Hajah says:

    Hi, Very interseting blog. i’ve got usefull information.

  • Hi Wan ..
    Really interested topics and great sample to explain the concept of Leveraging Long Tail bushiness.

    I realized hat Internet Archive is distrubilte its contents online. It also support the disable people accessibility and multilingual. I believe these three factors is enough to make popular over the whole word.

    I will add more information Internet Archive which I found in their website. Its main offices in San Francisco house about 30 employees. The Archive has an annual budget of $10 million, derived from a variety of sources: revenue from its Web crawling services, various partnerships, grants, donations, and the Kahle-Austin Foundation.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

    • Wan says:

      Yes Hashim, it’s interesting how Internet Archive spreads its services all over the world by decentralizing its servers over several continents. So in case if it got hack, the services won’t be disrupted unlike MIA’s experiences

  • Hi Wan ..
    Really interested topics and great sample to explain the concept of Leveraging Long Tail bushiness.

    I realized hat Internet Archive is distrubilte its contents online. It also support the disable people accessibility and multilingual. I believe these three factors is enough to make popular over the whole word.

    I will add more information Internet Archive which I found in their website. Its main offices in San Francisco house about 30 employees. The Archive has an annual budget of $10 million, derived from a variety of sources: revenue from its Web crawling services, various partnerships, grants, donations, and the Kahle-Austin Foundation.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

  • Amiel says:

    Nice blog! You’ve even manage to add an automatic reader; I find that very convenient. It’s great how a lot of online organizations are leveraging the long tail to cater for a specific niche. MIA, as you’ve mentioned is a good example because it takes into account a whole range of different languages. It basically caters for a whole range of people with different languages. So I can imagine the amount of visitor it would get world wide, surely it’s in millions. In addition, I feel a great sense of gratefulness on how the organization is considering the disabled people.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

    • Wan says:

      Yes, amiel you are absolutely right about that. Once we had tackled those issues, we could later harness their collective intelligence from the wisdom of crowd.

  • Nick says:

    I have just bookmarked archive.org. It is a great source of digital material. There is a HUGH collection of media and resources and it does used many of the best practises, including “the long tail”. But because of the abundance of material that is sometimes quite unique (not pop culture), I would imagine that it would be hard to find what you’re looking for. Collective intelligence is a good practise however, as so much content is unique and targeting niche audience it might be hard to judge content solely on user ratings.
    Great blog. I look forward to exploring Archive.org.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

  • Jack Marrows says:

    Great blog Wan. Full of interesting media and you have introduced an application I haven’t seen before. Its a great example for many reasons from allowing users to submit their own content (making the tail longer) to how the content is searched. How does the site prevent inappropriate content from being submitted? I imagine this could be a concern to them.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

    • Wan says:

      Yes you are right Jack, submitting junk content would be a main concern, let alone other obscene materials. I remember few days ago i did try to send some video materials i did it myself, and i got this auto reply message from a moderator telling me it will take less than 2 days to acknowledge the submission. I would assume this would be one of their method of controlling the wisdom of crowd through editorial review. Also, i noticed that they used Recaptcha to prevent spam.

  • Jack Marrows says:

    Great blog Wan. Full of interesting media and you have introduced an application I haven’t seen before. Its a great example for many reasons from allowing users to submit their own content (making the tail longer) to how the content is searched. How does the site prevent inappropriate content from being submitted? I imagine this could be a concern to them.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

  • Jack Marrows says:

    Great blog Wan. Full of interesting media and you have introduced an application I haven’t seen before. Its a great example for many reasons from allowing users to submit their own content (making the tail longer) to how the content is searched. How does the site prevent inappropriate content from being submitted? I imagine this could be a concern to them.

    [WORDPRESS HASHCASH] The poster sent us ’0 which is not a hashcash value.

Leave a Reply

You must be logged in to post a comment.