Fall 2003 - CS 553 (Call no 35366) Distributed Systems
Instructor:
Ajay Kshemkalyani
Email:
ajayk@cs.uic.edu
Class meeting times: T,R 12:30-13:45pm
We may need to set up a few extra class meetings to make up for any
of the regular meetings we miss due to my travel.
Room: SES 230
Office Hours: (T,R 2:00PM - 3:00PM)@915 SEO
URL: http://www.cs.uic.edu/~ajayk/c553fa03.html
-
(main text)
Advanced Concepts in Operating Systems by Singhal and Shivaratri, McGraw Hill, 1994
-
Selected papers and other material from the literature will be posted on the
web or distributed in class.
-
Additional reference texts
- Distributed Systems, by Sape Mullender, Addison-Wesley, 1993
- Distributed Operating Systems and Algorithms, by Chow and Johnson, Addison-Wesley, 1997.
- Distributed Systems Concepts and Design, by
Coulouris, Dollimore, and Kindberg, Addison-Wesley, 2001.
This course focuses on theoretical principles.
You will not be programming in this course.
- Models: synchronous/asynchronous; shared memory/message-passing
- Global states and snapshots; time models
- Group communication, managing group views
- State predicates
- Mutual exclusion, deadlock detection, leader election
- Distributed shared memory - coherence, models
- Checkpointing, rollback recovery; distributed debugging
- Agreement and consensus
- Peer-to-peer computing
- Multicast
- other current topics
The following is only a TENTATIVE breakup of the evaluation scheme and is
subject to changes as the course progresses.
- Term paper (25%); topic of your choice
[12% for original research content;
7% for breadth and depth of literature survey;
6% for organization and presentation]
- Class presentation, critiques, & attendance of other presentations (15%)
- Test 1 (30%). All or a part of this test may be take-home (depending on class-size).
- Test 2 (30%). All or a part of this test may be take-home (depending on class size).
The full scale of grades ("A" - "E") will be used.
Handwritten submissions, except for the exam(s), will not be accepted.
Algorithm analysis and design,
Operating systems; or permission of the instructor.
TENTATIVE course progress chart:
- Week 1.1: Introduction, models
- Week 1.2: models, failure models, synchronous/asynchronous,
blocking/nonblocking communication
- Week 2.1: elementary algorithms
- Week 2.2: Global states; Chandy-Lamport algorithm; read this survey.
- Week 3.1: time - scalar and vector clocks; partial orders; clock synchronization
- Week 3.2: predicates - possibly and definitely; detection algos
- Week 4.1: Causal order; total order
- Week 4.2: fault-tolerant protocols; layered protocol design
- Week 5.1: termination detection; message-passing mutual exclusion
- Week 5.2: message-passing mutual exclusion
- Week 6.1: message-passing mutual exclusion
- Week 6.2: agreement and consensus
- Week 7.1: Test 1 (October 7) in SES 230.
- Week 7.2: agreement and consensus
- Week 8.1: agreement and consensus
- Week 8.2: Distributed shared memory
- Week 9.1: Distributed shared memory
- Week 9.2: load balancing; fault-tolerance
- Week 10.1: fault-tolerance
- Week 10.2: checkpointing and rollback recovery
- Week 11.1: checkpointing and rollback recovery. Take-home test will be distributed.
- Week 11.2: In-class Test 2 (November 6) in SES 230.
;
also Take-home test due
- Week 12.1 (Nov 11): Presentation 2 by US,PS:
Coordinated placement and replacement for large-scale distributed caches,
Korupolu, M.R.; Dahlin, M., Page(s): 1317- 1329, IEEE TKDE, Nov/Dec. 2002.
- Week 12.2 (Nov 13): Presentation 3 by AB,MB:
Chord: a scalable peer-to-peer lookup protocol for internet applications.
Ion Stoica, Robert Morris, David Liben-Nowell, David Karger, M. Frans Kaashoek, Frank Dabek, Hari Balakrishnan,
IEEE/ACM Transactions on Networking 11(1): 17-32 (2003)
- Week 13.1 (Nov 18): Presentation 4 by SK,AS:
A Scalable Content-Addressable Network, Sylvia Ratnasamy, Paul Francis, Mark
Handley, Richard Karp, and Scott Shenker, ACM SIGCOMM 2001.
- Week 13.15 (Nov 19, 8:00am - 9:15am in SEO 1000): Presentation 1 by JS,AW:
Internet Time Synchronization: the Network Time Protocol, D. Mills. IEEE Trans. on Communications, 39(10): 1482-1493, Oct 1991.
- Week 13.2 (Nov 20): Presentation 5 by WK,MS:
Deno: Decentralized p2p object replication system for weakly connected environments,
Cetintemel, U.; Keleher, P.J.; Bhattacharjee, B.; Franklin, M.J.,
IEEE Transactions on Computers, 52(7), July 2003, pages 943-959.
- Week 14.1 (Nov 25): Presentation 6 by LM,CP:
Ad hoc On-Demand Distance Vector Routing,
Proc. 2nd IEEE Workshop on Mobile Computing Systems and Applications, 1999, pp. 90-100.
- Week 14.2: -- -- -- (Thanksgiving, Nov 27)
- Week 15.1 (Dec 2): Presentation 7 by LX,YH:
Distributed Construction of Connected Dominating Set in Wireless Ad hoc Networks.
21st Annual Joint Conference
of the IEEE Computer and Communications(INFOCOM), 2002.
- Week 15.2 (Dec 4): Presentation 8 by BW,WS: Web Prefetching: Costs, Benefits, and Performance,
International Workshop on Web Content Caching and Distribution (WCW), August
2002.
Term paper due at 5:00pm on December 4
Attendance is not compulsory. However, if you miss class, it is your
responsibility to find out what was announced and what was covered.
The course format will be in two parts.
- For the first part, the instructor will teach.
- The second part involves active student
participation and is planned as follows.
Each team of 2-3 students will be required to present 1 topic.
A paper will be assigned at least 3 weeks before its presentation date.
Each student has to read and critique EACH paper, INCLUDING the ones his/her
team is presenting, and submit a critique in class on the presentation day.
If several papers are being discussed in a class, each of them should have a
separate critique. Each critique should be anywhere between 1 to 2 printed
pages (short and crisp) and should have the following format.
- In your words, summary of the main contributions of the paper
- Your evaluation of the contributions to advancing science and the
state-of-the-art of the topic
- State any deficiencies that you found in the paper
- Provide directions for future improvements
- List at least three of your most intelligent questions you would ask the
author, or in this case, the presenters. The questions should not be
"can you explain xyz on page 123?"
The following are specific topics. For each, suggested papers for presentation, or suggested papers or suggested URLs for starting points to identify appropriate papers for presentation are given.
If you are not too thrilled with any of topics 3-8, feel free to suggest me some topics, based on some paper from ICDCS TPDS, JPDC etc. within the past two years.
It MUST have algorithmic content.
Please let me know which paper you would like to present. I encourage you to meet me in person during weeks 5-8 to discuss what you would be presenting. And continue meetings during the semester on a regular basis.
-
Physical clock synchronization.
Network Time Protocol (Version 3) Specification, Implementation and Analysis
David L. Mills. RFC 1305, 1992.
David Mill's home page for NTP. See IEEE TComm , ref [27] in particular.
Improved
algorithms for synchronizing computer network clocks, David Mills.
Presenters: AW, JC
-
Replica placement and caching on the Internet
Coordinated placement and replacement for large-scale distributed caches,
Korupolu, M.R.; Dahlin, M., Page(s): 1317- 1329, IEEE TKDE, Nov/Dec. 2002.
Mike's web page
Amin's web page
Presenters: US, PS
-
Algorithmic Challenges in Peer-to-Peer Systems. What are the topic-specific problems that pose new algorithm design challenges?
Chord: a scalable peer-to-peer lookup protocol for internet applications.
Ion Stoica, Robert Morris, David Liben-Nowell, David Karger, M. Frans Kaashoek, Frank Dabek, Hari Balakrishnan,
IEEE/ACM Transactions on Networking 11(1): 17-32 (2003)
IPTPS Workshops
Presenters: AB, MB
- More Peer-to-peer. Topology Overlays - Algorithmic Challenges.
Building topology-aware overlays using global soft-state,
Zhichen Xu; Chunqiang Tang; Zheng Zhang, IEEE ICDCS, May 2003, Page(s): 500- 508.
Presenters: SK, AS
- More Peer-to-peer. Topology Overlays - Algorithmic Challenges.
Decentralized p2p object replication system for weakly connected environments,
Cetintemel, U.; Keleher, P.J.; Bhattacharjee, B.; Franklin, M.J.,
IEEE Transactions on Computers, 52(7), July 2003, pages 943-959.
Presenters: WK, MS
- Mobile ad-hoc networks: What are the algorithmic challenges? Routing ....
AODV Routing algorithm
Presenters: LM, CP
- Mobile ad-hoc networks: What are the algorithmic challenges? Routing ....
Virtual backbone for multicast in ad-hoc networks
Presenters: LX, YH
- Web Content Caching and Distribution: What are the algorithmic challenges?
A course on Web Algorithms at ETH, Zurich
International Workshops on
Web Content Caching and Distribution (WCW),
11th World Wide Web Conference (WWW), 2002,
12th World Wide Web Conference (WWW), 2003,
Prashant's pubs
Presenters: BW, WS
Auxiliary information
Literature sources:
The following is a listing of journals and conferences
that you may find useful in your literature search for your term paper.
-
Inter. Conference on Distributed Computing Systems (ICDCS)
-
IEEE Transactions on parallel and distributed systems (TPDS)
-
Journal of Parallel and Distributed Computing (JPDC)
-
ACM Symp. on Principles of Distributed Computing (PODC)
-
IEEE Symp. on Parallel and Distributed Processing (SPDP)
-
Distributed Computing
-
Journal of the ACM
-
Journal of Algorithms
-
Workshop on Distributed Algorithms (WDAG)
-
SIAM Journal on Computing