Thanks for your interest, but I'm currently not looking for a job.

Arturo Crespo

1213 Estrada Terrace
Sunnyvale, California 94086

Phone: (650) 224-0652
arturo@crespoweb.com
http://arturo.crespoweb.com

Education

Ph.D. Computer Science, Stanford University.
Dissertation: Digital Library Repositories.
Advisor: Professor Hector Garcia-Molina
1996 - 2002
M.S. Computer Science, Stanford University.
GPA: 4.0/4.0
1994 - 1996
M.B.A. Instituto de Estudios Superiores de Administracion
GPA and rank: 3.92/4.0, 1st of 54
1990 - 1992
B.S. Computer Engineer, Universidad Simon Bolivar.
GPA and rank: 3.9/4.0, 1st of 216
1985 - 1990

Work Experience

  • Staff Software Engineer, Google Inc. 2003 -
    • Product Search (Froogle)
    • Google Checkout: as the first engineer in the project, conceived, designed, and form the team. Google Checkout launched in June 2006.
    • Google Billing and payments: responsible for the day-to-day of the billing of AdWords revenue. Automatized the system, significantly cutting down the time that the process took.
  • Research Assistant, Stanford University 1996 - 2002
    Advisor: Professor Hector Garcia-Molina
    • Led the design and prototype implementation of the Stanford Archival Repositories Protocol.
    • Researched and evaluated the complexity of algorithms for efficient handling of query subscription in a multicast environment.
    • Co-designed SDLIP, the Simple Digital Library Interoperation Protocol.
    • Investigated algorithms for fast reconciliation of database replicas with independent updates.
  • Research Intern, Xerox PARC Summer 1996
    Mentors: Dr. Eric A. Bier and Dr. Bay-Wei Chang
    • Led the design of the Meteor Shower Architecture. An architecture that allows responsive interaction for a large web application.
    • Developed WebWriter II Editor, a direct manipulation HTML editor that runs in a web browser, as a test of the flexibility and efficiency of the Meteor Shower Architecture.
  • Research Assistant, Stanford University 1995 - 1996
    Advisor: Dr. Andreas Paepcke
    • Developed a complex translator between the Dialog database and the Stanford Digital Library protocol. This translator offered unlimited virtual sessions by multiplexing over a small set of Dialog accounts.
  • Research Intern, Xerox PARC Summer 1995
    Mentor: Dr. Eric A. Bier
    • Led the design and implementation of WebWriter, an integrated system for constructing Web applications that supports the creation of Web pages by non-programmers.
  • Research Assistant, Stanford University 1994
    Advisor: Professor Jeff Ullman
    • Development of a Distributed Computer Assisted Instruction (CAI) system.
  • Assistant Brand Manager, Procter & Gamble 1992 - 1994
    • Led the development of new product initiatives.
    • Analyzed and evaluated markets, clients and competition.
  • System Analyst, MARAVEN 1989 - 1990
    • Designed and developed an automatic system for detection of poor performance conditions in MVS, and an expert system to suggest actions to correct them.
    • Developed of a semi-automatic system for the processing of oil well registers. Work involved re-programming in Assembler the control ship of a scanner, development of a graphic user interface, and programming of an artificial intelligence module.

Personal

Hobbies: hiking, backpacking and climbing. I've climbed all but one of the California peaks above 14000 feet.
President of the Stanford Outing Club (1996-2004)
Citizenship: Venezuela and Spain. US Permanent Resident.

Teaching Experience

  • Teaching Assistant, Stanford University Spring 1998
    Distributed Database Systems.
  • Instructor, Stanford University Summer 1997
    Database Systems Principles.
  • Teaching Assistant, IESA 1991 - 1992
    Math for Managers and Quantitative Techniques I.
  • Teaching Assistant, Universidad Simon Bolivar 1988 - 1990
    Logic; Logic Design; Introduction to Computer Science; Algorithms, Graphs and Structures; Computer Architecture and Assembly Language.

Dissertation: Digital Library Repositories

Current libraries use an assortment of uncoordinated and unreliable techniques for storing and managing their information. Some of their collections are in paper form, which can decay and can be lost. Digital information is stored in file systems, Web sites, database systems, and information retrieval (IR) systems. Typically the systems are good for some functions but not others. For example, a system may have good text search capabilities, but not reliable storage. Often information is stored in legacy systems that are hard to integrate with modern ones. Across these diverse systems, information is stored in various incompatible formats, and accessed using different command languages. My goal is to design and build a modern Digital Library Repository (DLR). Such a repository stores and manages the digital objects that constitute the library. My goal is not to replace existing systems (e.g., database systems, IR systems), but rather to allow existing and future systems to work together in managing and preserving an interrelated collection of digital objects (and their versions) in the simplest and most dependable way. Like any modern information repository, a DLR must be secure, reliable, and scalable. In addition, a DLR has the following combined key requirements that distinguish it from other repositories. First, digital objects (e.g., documents, technical reports, movies) must be preserved indefinitely, as technologies, file formats, and organizations evolve. Second, DLRs will be formed by a confederation of independent organizations. Third, published digital objects have a historical nature, so changes should not be done in-place; instead, they should be recorded in versions.

PUBLICATIONS

  • "Query Merging: Improving Query Subscription Processing in a Multicast Environment." Arturo Crespo, Orkut Buyukkokten, Hector Garcia-Molina. Accepted for publication in the IEEE Transactions on Knowledge and Data Engineering (TKDE).
  • "Routing Indices For Peer-to-Peer Systems." Arturo Crespo, Hector Garcia-Molina. In Proceedings of the International Conference on Distributed Computing Systems (ICDCS). July 2002.
  • "Cost-Driven Design for Archival Repositories". Arturo Crespo, Hector Garcia-Molina. In Proceedings of the First Joint Conference on Digital Libraries (JCDL). June, 2001.
  • "Modeling Archival Repositories." Arturo Crespo, Hector Garcia-Molina. In Proceedings of the Fourth European Conference on Digiatl Libraries (ECDL). September, 2000.
  • "Implementing a reliable digital object archive." Brian Cooper, Arturo Crespo, Hector Garcia-Molina. In Proceedings of the Fourth European Conference on Digiatl Libraries (ECDL). September, 2000.
  • "Efficient Query Subscription Processing in a Multicast Environment." Arturo Crespo, Orkut Buyukkokten, and Hector Garcia-Molina. In Proceedings of the 16th International Conference on Data Engineering (ICDE). San Diego, CA, USA, February 29 - March 3, 2000.
  • "Archival Storage for Digital Libraries." Arturo Crespo and Hector Garcia-Molina. In Proceedings of the Third ACM Conference on Digital Libraries, June 1998.
  • "Awareness Services for Digital Libraries." Arturo Crespo and Hector Garcia-Molina. In Research and advanced technology for digital libraries: First European conference (ECDL'97), September 1997.
  • "Extracting Semistructured Information from the Web." Joachim Hammer, Hector Garcia-Molina, Junghoo Cho, Arturo Crespo, Rohan Aranha. In Proceedings of the Workshop on Management of Semistructured Data, May 1997.
  • "A Standard Textual Interchange Format for the Object Exchange Model (OEM)." Roy Goldman, Sudarshan Chawathe, Arturo Crespo, Jason McHugh. Stanford Technical Report. Number SIDL-WP-1997-0061. March, 1997.
  • "Responsive Interaction for a Large Web Application: The Meteor Shower Architecture in the WebWriter II Editor." Arturo Crespo, Bay-Wei Chang, and Eric A. Bier. In Proceedings of the Sixth International World Wide Web Conference, April, 1997.
  • "WebWriter: a browser-based editor for constructing Web applications." Arturo Crespo, Eric A. Bier. In Proceedings of the Fifth International World Wide Web Conference, May 1996.

REFERENCES

Available upon request.