Collection ID: ua940

Collection context

Summary

Creator:
State University of New York at Albany. Office of Information Technology Services
Abstract:
This collection contains daily and monthly captures of the top level domain for the University website: www.albany.edu, as well as weekly captures of the University NewsCenter website: www.albany.edu/news Webcrawling is managed through the Internet Archive's Archive-It service.
Extent:
Approximately 720 GB and 2513 captures
Language:
English .

Background

Scope and Content:

This collection contains archived versions of the University website, dating back to 1997. Currently, the website is one of the main methods of delivery for news and information about the University. It contains the webpages of academic departments, administrative offices, and student groups. Content includes course and curricular information, University news updates, University publications, photographs, admissions information, and other information relating to the day-to-day functions of the University. Documents in the web archives include a variety of formats ranging from simple HTML documents, PDFs, and images, to more complex web documents containing embedded video and audio content, forms, and other common features.

The completeness of each website snapshot varies. The daily crawls of the University website collect a limited number of documents, whereas the monthly crawls collect as many documents as possible in a 5 day period. As a result, the amount of information available from a given day may be limited, so specific web pages may not be represented as comprehensively as others.

Biographical / Historical:

The website of the University at Albany was launched in the mid-1990s.

As of 2017, the Office of Information Technology Services partners with the Office of Communications and Marketing to provide Open Text (formerly RedDot), the content management system used to maintain University websites. ITS offers consulting on using the Open Text software, while Communications and Marketing provides support for content and design.

Acquisition information:

Web Archives are created through regular crawls performed by Archive-It, a service provided by the Internet Archives.

Archive-It performs regular crawls to create as complete a snapshot of the University's website as is feasible. In a given webcrawl, Archive-It begins from a seed (a high level domain, such as www.albany.edu) or set of seeds, and then automatically harvests successive layers of the website by following links from this seed. This process continues for a specified duration or until a specified number of documents has been harvested.

Series 1, the main UAlbany website, is crawled on a daily basis, with more comprehensive crawls performed monthly. The daily crawls are limited to approximately 1,000 documents whereas the monthly crawls harvest as much as possible in a 5 day period.

Series 2, the UAlbany NewsCenter website, is crawled once per week. Each crawl lasts no longer than 3 days.

Additional information on Archive-It can be found here.

Additional information on the Internet Archives can be found here.

Arrangement:

This collection consists of two series:

Series 1 contains daily captures of the top-level www.albany.edu domain and monthly captures of a longer list of domains.

Series 2 contains captures of the UAlbany NewsCenter website located at www.albany.edu/news.

Website captures are arranged chronologically.

Physical / technical requirements:

The University's Web Archives are hosted by the Internet Archives and can be view on the Wayback Machine. Viewing archived webpages requires a browser with Javascript turned on. If Javascript is turned off, images and links on web pages will be from the live web, not from archived web files.

Online content

Access

RESTRICTIONS:

Access to this collection is unrestricted.

TERMS OF ACCESS:

This page contains links to digital objects. Access to these images and the technical capacity to download them does not imply permission for re-use. Digital objects may be used freely for personal reference use, referred to, or linked to from other web sites.

Researchers do not have permission to publish or disseminate material from these collections without permission from an archivist and/or the copyright holder.

The researcher assumes full responsibility for conforming to the laws of copyright. Some materials in these collections may be protected by the U.S. Copyright Law (Title 17, U.S.C.) and/or by the copyright or neighboring-rights laws of other nations. More information about U.S. Copyright is provided by the Copyright Office. Additionally, re-use may be restricted by terms of University Libraries gift or purchase agreements, donor restrictions, privacy and publicity rights, licensing and trademarks.

The University Archives are eager to hear from any copyright owners who are not properly identified so that appropriate information may be provided in the future.

LOCATION OF THIS COLLECTION:
M. E. Grenander Department of Special Collections & Archives
Science Library 350
1400 Washington Ave
Albany, NY 12222, United States
CONTACT:
518-437-3933