
Top Ten Issues For Implementing The NERC CIP Reliability
Standard
The North American Electric Reliability Corporation 's (NERC) CIP Reliability Standard is the most comprehensive and pervasive standard among all of the 100+ NERC standards in force today. NERC was designated as this "electricity reliability organization" (ERO) by FERC on July 20, 2006. The standard, which addresses asset identification, security management controls, personnel risk assessment and training, cyber and physical perimeter security, systems security management, and incident reporting and recovery, was developed to ensure all relevant electric utility entities identify and protect Critical Cyber Assets that control or could impact the reliability of the Bulk Electric System. While a number of implementation dates for the standard vary among participants and infrastructure types, the June 30, 2009 date is critical for a significant number of entities. Subsequent to this date, staff from the Regional Entities will initiate their audits of the compliant Security Programs.
The path towards ensuring a compliant Security Program has been appropriately designed and can be sustained is proving to be a challenge. This article provides insight to the top ten issues that frequently surface during the course of implementing the requisite policies, processes, and evidentiary documentation requirements. The top ten issues are illustrated in Figure 1 and discussed in the paragraphs that follow.
Executive Engagement
One of the most strategic issues that can impact the
design and implementation of a NERC CIP Reliability Standard
compliant Security Program is the need for executive engagement at
the top of the organization. The need for this participation is
based upon:
-
The resultant policies, processes, and documentation required to implement a compliant Security Program impact multiple organizations.
-
Direction and support needs to come from a position that has the authority to provide an appropriate governance structure in order to ensure adequate participation and a collaborative environment resides within the requisite organizations.
-
Considerable funding is required to design, implement, and sustain the compliant
-
Security Program; appropriate budgeting at the executive level addressing the necessary vision across the corporation involving multiple organizations is needed to support the multi faceted program.
In today's environment, a number of competing strategic issues are continually being assessed at the corporate level. These issues include, but not limited to, decreasing revenues during the current economic turmoil, smart grid initiatives, "green" related environmental issues, availability of capital, acquisition and resurgence of nuclear energy, aging work force, generation resources, transmission requirements, and the general reliability of the electric grid. With these issues on the forefront, the attention needed to mitigate the risk of non‐compliance with the NERC CIP Reliability Standard is often lost in the shuffle. In order to ensure compliance is sustained, an executive oversight committee (where issues can be surfaced) coupled with a cross functional team to support resolution of issues and maturing of the Security Program is a valuable and tactical management approach.
Risks of
Non-Compliance
Non‐compliance can result
from a number of events culminating in substantial financial and
social impact on the corporate entity. From an enforcement
perspective, based on their audit findings of violations of the
requirements of the standard, the Regional
Entitiesshall, in accordance with published
guidelines, determine and levy monetary penalties and nonmonetary
sanctions and remedial actions. Assessed penalties
will be based on the Violation Risk Factors, Violation
Severity Levels, repetitive violations, ability to
pay, intentional violation, concealment of violation,
self‐disclosure and voluntary corrective
action, and other extenuating circumstances.
Based on the maximum limitation penalties of up to $1 million per day per event, monetary penalties can be very significant. NERC is not constrained from assessing the same penalty amount for multiple violations that occur on the same day. Many of the proposed Violation Severity Levels have been designed such that significantly lower penalties will be assessed for those violations that are small in number or brought into compliance in a short period of time. Companies can reduce the amount of potential penalties by ensuring that non‐compliance is identified, minimized, and resolved as soon as possible.
Non‐monetary penalties can also be levied with respect to limitations in activities, functions, operations, and other appropriate sanctions. Sanctions may also impact the corporate posture with negative press. The worst nightmare is being on the front page in a story that illuminates a significant violation or event that caused an outage of the bulk electric system. In any event, there are significant non‐monetary penalties that need to be evaluated in assessing the risks associated with non‐compliance at any level.
As a result, the need for a formal risk management program is surfacing in many entities that have not developed the same. The directions being pursued by the regulatory authorities in this regard indicate that managing risks will be high on the agenda. Integrating risk mitigating provisions into the operations and underlying compliant Security Programs (including all NERC and other relevant standards) from a corporate perspective is a Critical Success Factor and vital business practice.
Adequacy of Resources
The resources (staff and funding) required to
design, implement and sustain a compliant Security Program is quite
significant and represents an important and visible budget item.
Efforts to design a compliant Security Program have typically
required between 5,000 to 10,000 man hours exclusive of user
training, personnel risk assessments, and expenditures required for
cyber and physical security infrastructure upgrades. Ongoing
sustainment will increase head count between two and five full time
equivalents depending upon the type and number of infrastructures
(generation facilities, substations, control centers) containing
the Critical Cyber Assets (CCAs), the complexity of organizations
involved in managing these assets, the number of staff having cyber
and physical access to CCAs, and the Information Protection program
ingredients.
The recognition that the underlying Security Program requires significant resources to sustain the operations of the Security Program, especially in the first few years of operation, is one of the most significant miscalculations. For example, the effort alone to maintain configuration management through effective change control (including pre‐testing, post‐testing, approvals, documentation updates, etc.) for the requisite Critical Cyber Assets, Electronic Security Perimeters, and Physical Security Perimeters can be quite daunting. Personnel risk assessment issues, especially at the initiation of the program, can also present a challenge to existing staff. Ongoing sustainment staffing can ultimately be reduced through automation as described in the paragraphs that follow.
Collaborative
Environment
The CIP Reliability Standard requires that
organizations typically heretofore not operationally tightly
coupled, or that do not communicate frequently, to collaborative in
the design, implementation and sustainment of the compliant
Security Program. Specifically the corporate information technology
organization, or business IT, typically does not get involved in
operations of the transmission facilities through the EMS systems.
Likewise, the human resources department is not always linked in
with access control processes. Substation engineering and
transmission grid operations are also frequently located in
separate departments. Generation operations are almost always
separated. Finally, organizations addressing physical security and
cyber security organizations are typically separated.
These organizational characteristics presents challenges to the design, implementation and sustainment of a compliant Security Program as a significant portion of the underlying compliant Security Program processes require smooth hand‐offs across organizational boundaries. With this recognition, the governance issues raised earlier in this document become quite apparent. Ultimately, the need for a collaborative operations environment coupled with an appropriate governance structure is paramount in providing an environment for efficient and effective policies, processes, and plans to be sustained across the organization.
Effective
Program Structure
The provisions that
encompass the CIP Reliability Standard were designed by multiple
groups of engineers, information technology specialists, and
physical security managers. Unfortunately, the resultant standard
is not organized in a logical and functional manner. It has proven
to be very difficult and
ineffective to take a Standard‐by‐Standard,
Requirement‐by‐Requirement, approach to design, implement, and
manage the sustainment of the compliant Security Program. For
example, access control is addressed in four separate topical areas
of the standard. As a result, utilizing the structure of the
standard to govern the design, implementation, and sustainment
activities adds considerable risk and complexity to the sustainment
endeavor.
In order to simplify the design and implementation tasks and manage the sustainment of the compliant Security Program in an effective manner, a functional framework is considerably more appropriate. Functions define "what" needs to be done, processes (procedures) define "how" the work is to be performed, and "organizations" (the governance structure) defines who has responsibilities for performing the work. Functions are usually more static, processes typically cross functional boundaries, and organizations are frequently quite dynamic. Therefore, functional definitions provide a good "business model" foundation from which to assess, design, and implement new or enhanced programs.
Fifteen (15) different functional program areas10, as illustrated in Figure 2, have been designed as relevant to the design and implementation of a responsive and comprehensive NERC compliant Security Program. This functional program business model provides the framework for an organized and logical approach to continually identify gaps, and subsequently develop the required policies, procedures, documentation, and training and security awareness program ingredients. In addition, any identified technical requirements, including supporting software tools such as document control systems, identity management strategies, and network management tools, can be effectively developed in a controlled and consistent manner through this framework. Lastly, allocations of responsibilities within the organization can logically be assigned through a functional approach for sustainment of the Security Program.
It is important to note that visibility from a Standard‐by‐Standard / Requirement‐by‐Requirement dimension needs to be retained from an "audit" perspective. Auditors will need to understand and validate how each Standard and Requirement is being supported, e.g., what policy and process is being followed, and what evidentiary documentation is being generated to meet each of the underlying requirements of the standard. Accordingly, the ingredients of the compliant Security Program need to be mapped (using a defined matrix) to the Standard / Requirement that they support. In conclusion, a multi‐dimensional view is necessary to manage and provide ongoing audit support for the compliant Security Program.
Appropriate Process Definition
The CIP Reliability
Standard defines the need for "processes" throughout the
requirements delineated in the various sections of the standard.
Processes, in this context are frequently misunderstood as to what
needs to be included in a process. First, there are two types of
processes that need to be incorporated within a compliant Security
Program as follows:
-
Dynamic processes
-
Structured processes
Dynamic Processes are individually defined and assigned processes initiated to address an issue, gap, or commitment. Dynamic processes are typically uniquely defined "on the fly" and consist of one or two tasks assigned to specific individuals within the organization. The status of these processes are usually monitored until closed out or completed.
Structured Processes are pre‐designed, more complex, and processes typically consisting of multiple tasks and frequently involving multiple organizations. Structured processes need to be performed in a consistent manner and include appropriate internal control provisions.
Structured processes are:
-
Triggered by multiple Events, or the results of other Processes;
-
Assigned by Roles rather than individuals;
-
Contain a defined "Beginning" and an "End";
-
"Self‐documenting" e.g., generate evidentiary documentation each time they are performed
Define the "who, when, where, why, and how" the Policies (which defines the "what") will be implemented:
-
"Who" defines the roles and responsibilities;
-
"When" defines the timing of the Process (Event / Time Driven);
-
"Where" defines the location as applicable;
-
"Why" is to define the purpose of the process
-
"How" defines the sequence and description of the Activities and Tasks.
Both types of processes are required to sustain a compliant Security Program. Dynamic processes can best be supported through the use of automated commitment management or tracking systems appropriately integrated with existing internal E‐Mail and document management systems.
Structured processes are frequently designed to leverage existing procedures11 using "pointers" or "references" to guide users through multi‐task processes. This approach can leverage existing procedures and reduce the expenses associated with designing the required compliant Security Programs ingredients. However, as a result, these types of structured processes are frequently not "self‐contained12" and difficult to use while not retaining the key characteristics noted above. These characteristics impact the complexity of the Security Program and the ability to incorporate provisions for internal controls, ensure self‐documentation, identify appropriate process linkages, and incorporate appropriate performance measures. Recognizing that Critical Cyber Assets typically represent a small portion of the total number of cyber assets in the entity, at the risk and costs of duplicating processes11, it is highly recommended that, as a general rule, self contained processed be employed in the design of the compliant Security Program.
Consistency
Across Enterprise
It is recognized that
different types of infrastructures (generation plants, substations,
transmission grid control centers) and organizations (divisions)
operate differently. For example, generation facilities have
scheduled maintenance outages at which time a variety of changes
and upgrades can be made while transmission grid control centers
are 7 x 24 operations. In other situations, different labor laws
may impact the personnel risk assessment related processes in
different divisions of the enterprise. Accordingly, there is
usually a desire to customize the policies, processes, and
documentation for each type of infrastructure. While there are
certain processes that certainly need customization as such, the
complexity of the compliant Security Program can become
unmanageable if customization is allowed to flourish beyond reason.
This is especially true if provisions for automation are
contemplated in the future.
Therefore, in order to retain consistency and provide a means to effectively manage the complaint Security Program from a corporate perspective, customization should be minimized or employed in situations where the involvement of multiple organizations is minimal.
Automation
It is acknowledged that
the CIP Reliability Program is a complex and pervasive set of
requirements necessitating the design, implementation and
sustainment of a compliant Security Program that touches most
departments / organizations within an electric utility entity. In
addition, non‐compliance with the program has significant
ramifications that heretofore have not been a component of the risk
management agenda. Once a
compliant Security Program has been designed and successfully
implemented, maintaining an efficient and effective sustainment
becomes the objective of the day.
Automation of many of the processes, both dynamic and structured processes as discussed earlier, and evidentiary documentation tasks obviously increases the efficiencies and can result in hard dollar savings (personnel costs). A means to track gaps between changes in the NERC Reliability Standards and the compliant Security Program is also of great benefit to prevent future out of compliance conditions from occurring. However, many of the benefits for automation are not so obvious. For example, automation can:
-
Mitigate consequences of missing deadlines and commitments;
-
Minimize level of effort and costs to sustain Security Program
-
Ensure responsibilities for following through with process commitments, decisions, documents, and reports are completed;
-
Ensure that process triggers will be executed (scheduled events and results linked from other processes);
-
Improve timeliness and accuracy of compliance performance metrics and executive dashboards;
-
Alert staff to upcoming events and requirements;
-
Leverage information contained in existing systems;
-
Ensure processes, tasks, and results are appropriately documented;
-
Facilitate external audit readiness;
-
Provide a basis for identifying gaps;
-
Ensure compliance dates are maintained;
-
Generate audit trails; and
-
Ensure Security Program is consistently optimized and standardized across organizations and company.
The overall benefits for automating the basic functions of a compliant Security Program, provides a quick hard dollar payback amplified by the intangible benefits of ensuring compliance is maintained. There are a number of software solutions available in the marketplace, each with their advantages and disadvantages, which can be configured to provide an integrated solution while leveraging existing systems.
Adequate User
Training
User training is for those
individuals in the organization who will become involved in the
execution of the policies, processes and procedures and generate
relevant documentation. It is an essential component of the
compliance Security Program that is frequently overlooked. The
training includes review of the process responsibilities, defined
processes and procedures, triggers for the processes, and how to
generate the required evidentiary documentation and what to do with
the results. User training, in this context,
excludes the annual training
required by the standard for those staff that have access to
Critical Cyber Assets. The latter is a prepackaged training
curriculum that can be fulfilled in a short period of
time.
In summary, this component of the compliant Security Program is often overlooked and underestimated in terms of the time and resources required to continuously carry out the requirements of the program. Estimates for large organizations have ranged up to 1,600 man days of user time for the initial training and familiarization with the processes and documentation requirements (also called role‐based training). Subsequent training will be required as the program requirements evolve and issues, gaps, and commitments are addressed and fulfilled.
Managing
Compliance Performance
Self‐certification is a
milestone that signifies the completion of CIP Reliability Standard
compliant Security Program design, build, and implementation phases
and the start of the operations management. Only a portion of the
CIP Reliability Standard requires specified controls. Measures in
the CIP Reliability Standard in most cases refer to documentation
of the associated requirement and are not measures of a compliance
process or procedure performance. As a result, most registered
entities in a time and resource constrained environment have not
included focused design steps to ensure that controls, metrics, and
performance risk assessments are addressed for all procedures. Few
have had the luxury of analyzing newly developed CIP procedures for
performance variations (i.e., non‐compliance events). Additionally,
most independent assessments and audits of compliance readiness
target the identification of evidentiary documentation required for
CIP Standard requirements and measures, and not risk of
non‐compliance performance.
If the CMI SEI Capability Maturity Model13 concept is applied to CIP Reliability Standard compliant Security Programs at most registered entities, one could generalize that most have moved from the Initial (Ad Hoc) and Repeatable maturity levels to the Defined level. A key success factor in CIP compliance efficiency lies in the ability to imbed CIP compliance processes into existing operations. The more integrated the CIP processes are, the more effective the operations personnel will be in the execution of the additional requirements. With the procedures documented and personnel process knowledgeable as a result of implementation training, the procedures are then institutionalized as the "CIP" way of doing cyber and physical security. Progression to the Managed level requires process metrics and a management focus on reducing procedure variations.
It takes a combined top‐down and bottom‐up approach to identify and design an effective metrics program to support ongoing CIP compliance program management. Key performance indicators should be developed and linked to corporate objectives that include CIP Compliance objectives. Metrics should be identified and defined for supporting CIP processes using the SMART (specific, measureable, actionable, relevant, and timely) approach. Where existing systems have been modified to accommodate CIP requirements, new metrics reports will be necessary.
To learn more, visit www.dyonyx.com
El Paso Electric Company (EPE) is a medium sized integrated electric utility operating in the Rio Grande Valley in west Texas and southern New Mexico with over 347,000 customers. With approximately 1,000 employees, EPE has an installed net generation capacity of 1,500 MW including about 16% interests in the Palo Verde nuclear plant. While EPE is moderate in size when compared to many corporations, as an electric utility, the technology infrastructures are very complex incorporating real-time systems, large customer databases, multi-layered geographic information systems, communication and dispatch systems, and comprehensive facilities management systems all supporting the delivery of electricity to the customers. This environment, coupled with the key responsibility to operate and maintain the critical electric grid infrastructures for the region, sets the stage for the need of a robust set of information system infrastructures. These infrastructures, which include hardware architecture, application system architecture, network architecture, and supporting organizational resources, need to incorporate appropriate technology, integrated design concepts, and efficient processes to meet the needs of the corporation.
Executive Summary
EPE retained the services of DYONYX to perform a cooperative technical examination and assessment of various aspects of the current technical environment considered by leadership to be of high importance and worthy of focus. Our team concentrated on the methods and support of functional interfaces for core business applications, the design of EPE's LAN/WAN infrastructure, server strategies and storage architecture, desktop and software management, and secure communications.
Upon completion of this assessment DYONYX was awarded the contract to virtualize the EPE server environment to reduce the physical footprint required to provide current business functions while minimizing any disruptions to the current production environment. The intent of this consolidation was to reduce the physical footprint only and not change the logical infrastructure design as EPE was preparing to move their data center to a new location. DYONYX was asked to respond to an RFP on the data center relocation to determine if continuing to support their IT internally, hosting their services in a third party hosting facility or fully outsourcing all of IT was the most cost effective solution. Upon completion of the RFP, DYONYX was requested to respond to a full outsource RFP, which was also awarded to DYONYX.
End User Support
DYONYX provides technical personal computer, laptop and software assistance to all EPE personnel. Service Desk Level I personnel located in our centralized ITIL compliant Service Desk respond to employee questions and requests for service. Software support includes operating system, word processing, e-mail, presentation graphics, spreadsheet, drawing, illustration, file compression, workflow management, database, networking, web browser, and industry specific Commercial off the Shelf (COTS)

