Conference Presentations Conference Grants Registration Lodging & Travel Keynote Speaker Session Descriptions Conference Agenda Sunday Evening Reception Sunday Training Session General Information HEDW- home
Higher Education Data Warehousing Conference 2008 - March 30 - April 1, 2008 - Blacksburg, VA
menu
Session Descriptions  

Download a PDF of the Session Descriptions (updated 2/29/2008) HEDWpresentations.pdf

Monday March 31

9:15 to 10:15 am - Break-out Sessions

Latham D-E-F
Change Data Capture Made (sort of) Easy, Mike Koontz & David Melton, University of Richmond

Data warehouse environments generally rely on source system processes to determine the deltas ( inserts, updates and deletes) within the database. Without source system generated timestamps, determining these changes rely on traditional methods such as batch compares or custom database triggers. Oracle's Change Data Capture (CDC) feature offers the ability to read the database transaction logs to determine the deltas without the heavy load processing or invasive customization normally associated with traditional methods. University of Richmond has implemented CDC with a set of custom written PL/SQL procedures and metadata tables to automate the capture, transformation and transfer of the deltas into our data warehouse. This presentation will focus on the particular CDC method Asynchronous Hotlog, the rational behind choosing this method and the various additional steps required to make the data useful for data warehousing.

Solitude
Data Warehouse and BI Approved: Now what?, Joseph Kerr, University at Buffalo

The presentation will describe an overall framework for designing, building, and deploying a data warehouse and business intelligence solution using the UB experience as an example. It will cover stakeholder and vendor management, project planning, roles and responsibilities, designing and building data models, reports and dashboards, and deployment. It will identify the most significant challenges, lessons learned, and the hidden value these projects deliver. There will be a short demo of the UB solution as well.

Assembly Hall
Data Governance at MIT, Mary Weisse, MIT

MIT has a very comprehensive infrastructure that has helped make our life a little easier when it comes to Data Governance. We are really good at managing data, authorizations, and authentications. There are still areas where we can improve (local data sources, real-time availability of data, etc). We'd like to share what we've done, where we are, and have an open discussion about where we and others are headed.

10:30 to 11:30 am - Break-out Sessions

Latham D-E-F
From Data Warehouse to Business Intelligence: The Michigan Journey, John Gohsman & Sean Mallin, University of Michigan

The University of Michigan has been building its relational data warehouse since the early 1990's. Data was exposed to end users via several data sets that could be integrated. An ERP implementation required the University to rebuild new versions of these data sets. After the ERP system was stabilized, executive officers found that we were not leveraging the vast amount of data to make better decisions. In response, Michigan under took the development of a Business Intelligence (BI) strategic plan. During the plan development, we realized the need to push our data warehouse and BI to a higher level which required a more sophisticated technical architecture and advanced data models. These advancements would improve query performance, ease access to complex information, and improve analytical capability. We believed the results would be increased user adoption and improve decision making. As part of implementing our BI strategy, we selected a delivered BI solution to augment our in-house developed solution. Campus energy is now up and demand for information is high. We will share the Michigan journey, demonstrate how a delivered solution jump-started our BI plans and demonstrate the HR product we co-developed with iStrategy.

Solitude
Open Source Reporting at Indiana University, Robert Serbent & Rebecca Gribble, Indiana University

This presentation will talk about the history of delivering decision support data at Indiana University, our current strengths, needs and deficiencies, and our future vision for the use of open source reporting coding projects and business intelligence software at IU.The Open Source project we chose to start with was the Eclipse based BIRT project. Eclipse is an open source community whose projects are focused on building an open development platforms with extensible frameworks, tools and runtimes for building, deploying and managing software systems through their lifecycle. BIRT stands for Business Intelligence Reporting Tool. It is an official Eclipse project that provides a reporting system for web applications, especially those based on Java and J2EE. BIRT consists of two major components: a report designer based on Eclipse, and a runtime engine that can be added to just about any application server.Our talk will discuss why Indiana University choose to implement BIRT, demonstrate its best features, both from an end user perspective and as a report developer, describe its architecture, and discuss how we were able to integrate it with the only outlay being a minimal number of labor hours. We will also focus on how we see open source products being used in BI at Indiana University in the future.

Assembly Hall
Building A Solid Foundation for Data Quality and Data Integration, B.K. Chen, University of Washington

As more and more data from different data sources (HR, Academics, Financial, etc) are loaded into university BI environments, data quality and data integration issues become extremely challenging. Master Data Management is a strategy taken by UW DSS to tackle these difficulties. One key component towards facilitating data integration and the creation of Master Data in UW DSS is the development of a standard time dimension.

Higher education institutions run on many different calendars. In UW's case, there are academic, federal, state fiscal calendars, etc. Operational systems typically have defined and built their own ways of recording event times. The users of data, however, often require reporting across different data sources and business domains. This presentation will discuss how this standard time dimension, as one of the basic steps of building master data, was designed, developed, and implemented. I will use that case to explore the implication of Master Data Management in higher education setting.

11:30 am to 12:45 pm - Latham A-B - Lunch

1 to 2 pm - Break-out Sessions

Latham D-E-F
Empowering Institutional Change Across Virginia's Community Colleges Through Data: Moving Toward a Comprehensive Data Warehouse Through Two Datamarts, Bruce Bartek, Tim Jones, Susan Wood, Monty Sullivan, Virginia Community College System

The system office of Virginia's 23 community colleges is leading a comprehensive data warehouse project. Providing impetus for this project was the conversion four years ago of the legacy student information system to a PeopleSoft/Oracle SIS at which time most of the standard reporting functionality built up around the legacy system was lost. Additionally, the introduction of a new SIS with increased functionality into a multi-institution environment created many data quality issues. This session will describe the progress to-date as the system has moved from limited access and inconsistent data towards an integrated portal. A curriculum datamart focusing on enrollment, awards, and program productivity data was developed and rolled out to college users at the "researcher" and "consumer" levels. Nearing completion and rollout is a second datamart focusing on retention that enables the user to define and track a cohort on a number of analysis measures. These two datamarts represent proofs-of-concept whose use by the colleges will inform a strategic value assessment. The full build-out of the data warehouse will incorporate outcomes from the strategic value assessment. In this session, presenters will describe the evolution of the project, involvement of key stakeholders, challenges in development and implementation, and lessons learned.

Solitude
Tackling the Challenge of Integrating Locally Developed Applications with University Data, Lance Tucker & Josh Millinger, Boston College

The proliferation of desktop database technologies over the last twenty years has resulted in a myriad of data quality and security issues on campus. This presentation will focus on Boston College' strategy of using Oracle's Application Express in conjunction with our Oracle Data Warehouse to deliver safe and efficient information delivery solutions to our customers. Administrative groups have been able to build their own local applications without IT assistance. Applications like FileMaker Pro spread quickly. Users became proficient with this technology in a short period of time and built solutions without any IT costs. Many useful applications evolved that serviced the University. However, key problems also developed around data quality, data security, and data management. This presentation will focus on Boston College's effort to address the two largest issues these local applications presented:

  • Multiple Systems of Record
  • Data Security - authentication and authorization

The presentation will review the architecture we chose to address these issues. We will go into detail on the data delivery and security mechanisms used to provide a more robust solution. We will also explore the policies required, business processes, and associated costs to support this solution. A working application will be demonstrated using this approach.

Assembly Hall
Lessons in Faculty Activity: From Politics to Analytics, Aaron Walz & Beth Ladd, University of Illinois

One common challenge for academic administrators is understanding the range of activities faculty are involved in. This information is essential context for evaluating research productivity, making promotion and tenure decisions, determining where to best allocate scarce resources, and responding to surveys. However, it often remains elusive because it requires integrating data spanning finance, pre-awards, grants, payroll, human resources, and student enrollment. The business questions that must be answered to create individual puzzle pieces, as well as the complex logic required to connect them, combined with culture, organizational and political factors create real challenges when trying to establish common business rules and definitions around such questions as "who are my faculty?" This presentation will describe a project at the University of Illinois that designed and built an OLAP cube to support Faculty Activity Analysis. It will begin by covering the many political and organizational challenges encountered in starting up and sustaining such a project and how they were addressed. It will also describe the overall approach, design challenges, business questions and how they were resolved. Finally, it will demonstrate the end result and describe the reaction from the intended customers.

2:15 to 3:15 pm - Break-out Sessions

Latham D-E-F
Building Next Generation Data Marts at Cornell, Jeff Christen, James Singleton & Yiorgos Marathias, Cornell University

Cornell University has chosen and is implementing a path towards an Enterprise Data Warehousing solution. This strategy involves:

  • Using the Kimball Methodology to manage the project lifecycle along with developing Dimensional Models (STAR Schemas) for new Data Marts;
  • Utilizing the mature infrastructure and resource with Cornell Information Technology;
  • Utilizing both Internal Resources and an External Consultant Organization for new data marts and when re-engineering existing data stores;
  • Delivering data marts in support of new Operational Application roll-outs. Specific areas that will be expanded upon within the presentation include:
  • Business User Requirements definition and translation;
  • Project Planning, Scope and Commitment;
  • Data Warehousing Development Methodology;
  • Cornell Universities Data Warehousing Technical Architecture;
  • Utilizing Source Applications / Legacy and Local Data.

Solitude
The Virtues of Cross-Training, Steve Grantham, Boise State University

At Boise State University we are in the process of implementing the iStrategy Data Warehouse for Student data. After having worked in the Enrollment Services area on our PeopleSoft implementation for about 8 years, I moved over to our IR department to lead the Data Warehouse implementation. One thing I have found in making this transition is that many of the precise details - some might say idiosyncrasies - of the business practices of offices like Admissions and the Registrar have not necessarily been well known or understood on the IR side of the house. And my transition has driven home to me the point that such offices naturally design and implement their business practices in a way that optimizes transactional processing, not necessarily reporting. I believe that my familiarity with many of the details of the transactional system has made it easier for me to analyze and understand the logic needed to populate the data warehouse correctly. This presentation will give several specific examples intended to illustrate this general principle and highlight the importance of communication between the transactional and reporting units of a university.

Assembly Hall
Cooperative Data Quality Checking and Improvement, Dylan Cooper, University of Arizona

Bad data, or the suspicion of it, is a major concern for reporting and decision support systems. At the University of Arizona, there has long been anecdotal evidence of data quality issues. This has led to problems for data analysts and uncertainty about the reliability of the data in general. During the past two years, the Information Warehouse Office has partnered with the users of the personnel and student information systems to find and correct data quality errors. We designed and implemented a system that checks the data as soon as it is loaded into the operational data store and displays any errors found in a web site. The administrative applications staff uses the web site to correct the data in their systems. This process has allowed us to find and correct thousands of cross-system inconsistencies, missing values, miscoded attributes, invalid dates, etc. Additionally it has allowed us to measure the quality of our data in ways we were not able to before. Happily we discovered that it is better than some people feared. This talk will outline the technological and sociological approach we have taken, the problems we have faced, the successes we have had, and our future plans.

3:15 to 3:45 pm - Afternoon Break

4:00 to 5:00 pm - Break-out Sessions

Latham D-E-F
Tips and Tricks with SQL Queries, Michael Wonderlich, University of Illinois

This presentation will focus on intermediate and advanced techniques in SQL writing. Techniques for analytical processing and advanced relationship combinations will be reviewed. Also tips for writing more efficient SQL will be provided.

Solitude
Enterprise Reporting at the University of Delaware, Kat Collison & Karen DeMonte, University of Delaware

In the fall of 2004, the University of Delaware began converting its Student Administration system to PeopleSoft. . Institutional Research was asked to lead the reporting effort during the implementation phase using a new reporting system, Cognos. PeopleSoft, like other transactional data base systems, is designed for processing information, not reporting. With a system no one knew and a database structure hostile to reporting, progress was slow. A data warehouse was the solution for working with conflicting data structures (the PS system is a transactional database and Cognos prefers dimensional data structures). Institutional Research is now in partnership with Information Technology to design and manage the University of Delaware Enterprise Warehouse - UDEW IT. This presentation will delve into the design and content of the data warehouse from both a functional and technical point of view as well as review its Business Intelligence solution. Emphasis will be on: 1) Gathering specific reporting requirements and business rules; 2) Building the Student Data Mart using Cognos ETL tool - Data Manager, and Framework Manager, 3).Demo of generating reports using Cognos ReportNet and the Student Data Mart.

Assembly Hall
Securing Data in the University of Washington Enterprise Data Warehouse, Anja Canfield- Budde and Glenn Pittenger, University of Washington

The goal of this design is to implement row level and column level security on specific SQL Server objects based on Roles. Based on certain parameters such as organization type, users are classified into a finite set of Roles, which ultimately govern access to sensitive information like Social Security Numbers, Date of Birth, Race or Ethnicity. Within a Role, users receive Span of Control values according to certain parameters, such as Payroll Unit Code (PUC). The information about Roles and Span-of-Control is captured in the campus-wide authentication system ASTRA and will populate to the Data Access Control (DAC) databases from there.

Our talk will introduce the different players in the game: the Roles & Access Matrix, ASTRA, Group Directory Services, Windows Active Directory, and Microsoft Reporting and Analysis Services (SSRS/SSAS). By walking through the design of the SQL server components of the Data Access Control (DAC) schema, our talk will then focus on how we apply security at the data level. We will describe relevant user relationship objects, show entity relationship diagrams, and explain the logical types of tables in the Data Access Control schema.

5:00 to 6:00 pm - Latham A-B - HEDW Business Meeting

April 1 (Tuesday)

7:00 am - Foyer - Continuous Break

8:00 to 9:00 am - Break-out Sessions

Latham D-E-F
Dash and go, working with Data Warehousing…, Jacqueline Nottingham & Pamela McAlexander, Virginia Tech

Because of the variety of customers it must work with, Admissions is considered multifaceted and complex. At Virginia Tech, we have developed business processes for focusing our attention on communicating admissions information internally and to external constituents at the University. This session will provide background on the need for developing a more comprehensive plan for information distribution and establishing stronger relationships with our colleagues, utilizing a data dashboard. We will discuss how Virginia Tech has integrated the use of a Dashboard to create a streamlined process for informing program areas and administrative personnel across campus of needed information. The ability to provide more efficient and higher quality service will be covered as well. This session would be helpful to anyone in the Recruitment or Admission arena. We believe this forum will provide an opportunity for others to learn from our experiences and to help develop their own plans for working smarter with their constituents. It will also provide a structured environment to share and discuss new ideas with others who face similar issues.

Solitude
The University of Texas' project Information Quest Working Model of Data Governance, Jarrett Cole & Darin Mattke, University of Texas

Project IQ is able to deliver exceptional reporting-related products via the implementation of a highly sophisticated and integrated functional model of data governance. UT's team of Data Architect, Data Quality Analyst and BI Developers will diagram and explicate the intricacies of the fine-tuned mechanisms utilized to achieve the delivery of one version of "Truth" University leaders have come to depend on in making the most informed decisions possible. Examples of how the model operates will highlight the roles of Data Stewards, the ETL Process, DBAs, Data Quality Analysts, BI Development, and our Management & Support section. This presentation will also illustrate the procedural processes we define as Logistics, Standards, Deliverables and communication. These areas deal with data staging, data availability, establishing data definitions and ensuring data quality. A successful implementation is communicated via an iterative process of requirement gathering, developing and testing products, and otherwise interfacing with our community of developers and consumers. The tangible outputs are in the form of reports, cubes and metadata repositories. After attending this session, participants will have a road map to follow for a successful data governance model.

Assembly Hall
From the Data Warehouse and EIS to Business Intelligence, the evolution of Penn State, Marta Miguel & Yvonne Riley, Penn State

The presentation will focus on Penn State's evolution from the Data Warehouse and Enterprise Information System (EIS) (Cognos 7) to a University-wide Business Intelligence strategic vision. It will include an overview of the current DW and EIS systems, current governance and stewardship model, and strengths and weaknesses. Both of these systems currently have over 1000 very active users. Then it will introduce the BI Strategic Vision, how we arrived to that vision, how we received leadership approval, and our approach to implementing it. The strategic vision states that information is quickly becoming a strategic asset and that for Penn State to be able to leverage that asset it needs to consider three factors: Governance and Policy, organizational structures, and software, hardware and data infrastructures. Within the strategic vision each of these three factors needs to be re-assessed. The presentation will explain why information is becoming a strategic asset as well as describe the proposed organizational model and reference technical architecture. This presentation will be useful for people trying to justify a Business Intelligence implementation at their universities or for someone trying to create a university wide strategy for BI.

9:15 to 10:15 am - Break-out Sessions

Latham D-E-F
Association of American Universities Data Exchange DW, John Scanlon & Mary Weisse, MIT

MIT has designed, built, and currently hosts a Data Warehouse for The Association of American Universities Data Exchange (AAUDE). AAUDE is a public service organization whose purpose is to improve the quality and usability of information about higher education. Their membership is comprised of AAU institutions that support this purpose and participate in the exchange of data/information to support decision-making at their institution. This exchange is facilitated by the DW that resides at MIT. MIT would be interested in sharing the DW design, the development process, and how we support our customers at these Institutions.

Solitude
Expanding Data Warehouse Services: Opportunities and Risks, A Panel Discussion, Suneetha Vaitheswaran, University of Chicago; Andrea Ballinger, University of Illinois; Ted Bross, Princeton University; Ora Fish, Rensselaer Polytechnic Institute; Jeffrey Glatstein, University of Massachusetts

A panel of colleagues from higher education institutions who have deployed enterprise data warehouse solutions will discuss their alternate perspectives regarding the demand, benefits, challenges, and decision points in expanding data warehouse strategy and reach. Additional services which may be discussed are using the data warehouse as a feed source of data to other enterprise systems, increasing the load or refresh frequency to near real-time, distributing standard report development beyond the central development group, etc.

Assembly Hall
Developing & Using Key Performance Indicators (KPI) to Provide Institutional Strategic Direction, Jack Mahoney & Jeff Stark, Rensselaer Polytechnic Institute

Higher education institutions, like all industries, have numerous measures that provide insight into areas requiring attention. The challenge in any organizational setting is to clearly identify an agreed upon set of business metrics that represents the vital areas of the organization. Rensselaer Polytechnic Institute will share their experience in developing their Key Performance Indicators and Faculty Workload applications, both of which provide essential tools to focus on areas defined by the university as critical and strategic. The discussion will focus on the process Rensselaer used to define a set of business metrics, why these particular metrics were selected, how they were structured in an easy-to-use dashboard, and where they are being used to produce actionable results. Coupled with the presentation will be a demonstration of Rensselaer's Performance Metric and Faculty Workload Dashboards.

10:30 to 11:30 am - Break-out Sessions

Latham D-E-F
Using nVision and PeopleSoft Security with Kimball-style Data Warehouse, Craig Leslie, DePaul University

nVision is a widely used MSExcel based BI tool among PeopleSoft users, especially among financially oriented applications. Furthermore, nVision can combine the results of multiple queries and complex data manipulations on a single page report, taking advantage of the full range of Microsoft presentation formatting. nVision has suffered from slowness in the PSQuery layer when extracting data from complex PSoft transaction tables. But when a highly tuned, dimensional DW is used as the source, performance increased dramatically while retaining the reporting advantages of nVision.

Solitude
Faculty Workload Analysis, A Panel Discussion, Emily Thomas, Stony Brook University; Robert Duniway, Seattle University; Richard Howard, University of Minnesota; Jack Mahoney, Rensselaer Polytechnic Institute; Aaron Walz, University of Illinois

Measuring faculty workload is important to university management and complicated. The panelists in this session will lead a discussion of the analytic, technical, and political issues involved. Questions include: How can data on teaching and non-teaching workload be captured? What metrics should be reported? What governance issues arise?

Assembly Hall
Establishing a Business Intelligence Competency Center (BICC), Michael Wonderlich, University of Illinois

In today's industry the hot topic is a Business Intelligence Competency Center (BICC). Business Intelligence is becoming a critical piece in the success of a data warehouse environment. A BICC provides an organized method to address the staffing, process, BI culture and infrastructure needs of your university. In this session we will

  • Define the components of a BICC.
  • Examine how existing services match to the components of a BICC.
  • Review how to assess the gaps in our organizations
  • Discuss ideas how to fill those gaps.

11:30 am to 12:45 pm - Latham A-B - Lunch

1:00 to 2:00 pm - Break-out Sessions

Latham D-E-F
Birds of a Feather---Sessions to be determined by attendees

Solitude
How not to manage a data warehouse development project, Bob Duniway, Seattle University

Seattle U. demonstrated the value of reporting against a central data warehouse in 2003, and we wanted to expand our efforts. Although we are going operational with our new and improved dimensional data warehouse in March, I would be lying if I said the process of developing this solution was well managed by the university. This is a Jesuit university, so lying is not an option. Instead, I'll share the truth of our process, what went right, what went wrong, and how we recovered.


 
HEDW-Home

Virginia Tech - Invent the Future
 

General Information | Sunday Training Session | Sunday Evening Reception | Conference Agenda | Session Descriptions
Keynote Speaker | Lodging & Travel | Registration | Conference Grants | Conference Presentations


HEDW - Home Page |Continuing & Professional Education @ Virginia Tech