./hw9 http://www.thesite.com/thepathyour program will do the following:
./hw9 http://amita.cs.uic.edu Visited pages: 1 Links checked: 11 Broken links ( 1): http://amita.cs.uic.edu/www.openwrt.org linked from: http://amita.cs.uic.edu ./hw9 http://www.cs.uic.edu/~llyons/ Visited pages: 9 Links checked: 22 Broken links (1): http://www.cs.uic.edu/~llyons/www.mcmaster.com linked from: http://www.cs.uic.edu/~llyons/ ./hw9 http://rites.uic.edu Visited pages: 15 Links checked: 94 Broken links (4): http://www.ssn.uillinois.edu/html/ssn_forms_docs.html linked from: http://rites.uic.edu/iaPolicies.html http://www.uic.edu/depts/las linked from: http://rites.uic.edu http://rites.uic.edu/courses.html http://rites.uic.edu/faculty.html http://rites.uic.edu/graduates.html http://rites.uic.edu/index.html http://rites.uic.edu/industry.html http://rites.uic.edu/labs.html http://rites.uic.edu/misc.html http://rites.uic.edu/projects.html http://rites.uic.edu/students.html http://www.uillinois.edu/about/policies.html linked from: http://rites.uic.edu/iaPolicies.html http://www.vpaa.uillinois.edu/policies/internet.asp?bhcp=1 linked from: http://rites.uic.edu/iaPolicies.htmlOther good sites to try your stuff on: http://logos.cs.uic.edu/reed/ http://www.uic.edu We'll be using this one as one of the evaluation examples: http://www1.cs.uic.edu
curl http://www.cs.uic.edu/ | tr ">" ">\n" | sed -r -n 's/.*<(a|img)[^>]*(href|src)=["'']?([^" >]*)["'']?.*$/\3/p'With "curl", if a page download fails it'll give you a non-zero return code. You can check it like this:
curl http://some.strange.url if (( $? != 0 )); then echo "curl returned error"; fiIn other cases, the download succeeds, but you get an HTTP error code (404 not found, for example). To see the error codes, use
curl -v http://theurl 2> /tmp/stderr.log > /tmp/thefilethis will output the HTTP header on stderr, which is then redirected to /tmp/stderr.log. In this example, stdout is also redirected, but to a different file. Watch out for relative URLs, both in HTTP redirects and in HTML pages. The can be either host-relative (starting with /), or path relative (starting without /), and may contain both '.' and '..'. They may also start with "http:" but without the "//", as illustrated on http://rites.uic.edu
./hw9 http://www1.cs.uic.edu Visited pages: 92 Links checked: 1125 Broken links (51): http://www1.cs.uic.edu/~webmaster/dls/distlect05.html linked from: http://www1.cs.uic.edu/www/home.php?audience=public http://www1.cs.uic.edu/www/home.php?audience=public&label= http://www1.cs.uic.edu/www/newsArchive.php?audience=public&label=News http:// linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://liu.ece.uic.edu linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://vienna.che.uic.edu/personalpage/ linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://acm.cs.uic.edu/~cpw/stump/rules.html linked from: http://www1.cs.uic.edu/www/calendar.php?audience=public&label=Calendar http://acm.eecs.uic.edu/~cpw/stump/ linked from: http://www1.cs.uic.edu/www/calendar.php?audience=public&label=Calendar http://linux.pharm.uic.edu/ linked from: http://www1.cs.uic.edu/www/calendar.php?audience=public&label=Calendar http://multimedia.ece.uic.edu/~ashfaq/ linked from: http://www1.cs.uic.edu/www/faculty.php?audience=public http://www1.cs.uic.edu/www/faculty.php?audience=public&label=Faculty http://www1.cs.uic.edu/CSweb/documents/gradmanual2002.pdf linked from: http://www1.cs.uic.edu/www/gradadmit.php?audience=public&label=Graduate http://www1.cs.uic.edu/www/gradadmit.php?audience=public&label=Graduate%20Admissions http://acm.cs.uic.edu/ linked from: http://www1.cs.uic.edu/www/links.php?audience=public&label=Links http://www1.cs.uic.edu/CSweb/speakers/andrewYao.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/aravind.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/derHorngLee.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/FransKaashoek.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/janetKoledner.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/lanceFortnow.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/langford.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/leslieLamport.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/linCai.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/moshe.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/rajJain.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/rayDeCarlo.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/riccardoPucella.php linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/www/<li><a linked from: http://www1.cs.uic.edu/www/seminars.php?audience=public&label=Seminars http://www1.cs.uic.edu/CSweb/speakers/cristea.htm linked from: http://www1.cs.uic.edu/www/news.php?audience=public&label=&ind=93 http://www1.cs.uic.edu/CSweb/speakers/chrisDing.htm linked from: http://www1.cs.uic.edu/www/news.php?audience=public&label=&ind=96 https://bannerweb.apps.uillinois.edu/uic/prospect linked from: http://www1.cs.uic.edu/www/ugradadmit.php?audience=public&label=Undergraduate%20Admissions http://www1.cs.uic.edu/www/ugradadmit.php?audience=public&label=Undergraduate http://www2new.cs.uic.edu/www/home.php?audience=public linked from: http://www1.cs.uic.edu/www/ http://www.cs.uic.edu/~abicknel/ linked from: http://cs.uic.edu/~abicknel/ http://www.cs.uic.edu/~aganti/ linked from: http://cs.uic.edu/~aganti/ http://www.cs.uic.edu/~ashoukry/ linked from: http://cs.uic.edu/~ashoukry/ http://www.cs.uic.edu/~awalters/ linked from: http://cs.uic.edu/~awalters/ http://www.cs.uic.edu/~ekhokhlo/ linked from: http://cs.uic.edu/~ekhokhlo/ http://www.cs.uic.edu/~kapichon/ linked from: http://cs.uic.edu/~kapichon/ http://www.cs.uic.edu/~pgoripar/ linked from: http://cs.uic.edu/~pgoripar/ http://www.cs.uic.edu/~rlamoren/ linked from: http://cs.uic.edu/~rlamoren/ http://www.cs.uic.edu/~sfaci/ linked from: http://cs.uic.edu/~sfaci/ http://www.cs.uic.edu/~smorris/ linked from: http://cs.uic.edu/~smorris/ http://www.cs.uic.edu/~vpritik1/ linked from: http://cs.uic.edu/~vpritik1/ http://www.ego.net/us/il/chicago/ttd/default.asp linked from: http://www1.cs.uic.edu/www/contact.php?audience=public&label=Contact http://www.evl.uic.edu/EVL/EVLERS/dana.html linked from: http://www1.cs.uic.edu/www/staff.php?audience=public&label=Staff http://www.me.uic.edu/faculty/cetinkunt.htm linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://www.me.uic.edu/faculty/darabi.htm linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://www.ohare.com/midway/home.asp linked from: http://www1.cs.uic.edu/www/contact.php?audience=public&label=Contact http://www.uic.edu/cba/cba-depts/ids/facultyprofiles/aris.htm linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://www.uic.edu/cba/cba-depts/ids/facultyprofiles/wxding.htm linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://www.uic.edu/depts/bioe/faculty/u_diwekar/index.htm linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://www.uic.edu/depts/bioe/faculty/y_dai/index.htm linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct http://www.uic.edu/depts/enga/currstud/studentactivities.htm linked from: http://www1.cs.uic.edu/www/ugradadmit.php?audience=public&label=Undergraduate%20Admissions http://www1.cs.uic.edu/www/ugradadmit.php?audience=public&label=Undergraduate http://www.uic.edu/depts/oae/campus_accessibility_map.html linked from: http://www1.cs.uic.edu/www/contact.php?audience=public&label=Contact http://www.uic.edu/depts/psch/ohlson-1.html linked from: http://www1.cs.uic.edu/www/adjunct.php?audience=public&label=Adjunct
Copyright 2016 The Board of Trustees of the University of Illinois.webmaster@cs.uic.edu |
WISEST Helping Women Faculty Advance Funded by NSF | ![]() | ![]() |