[Federal Register: June 1, 1998 (Volume 63, Number 104)] [Rules and Regulations] [Page 29620-29643] From the Federal Register Online via GPO Access [wais.access.gpo.gov] [DOCID:fr01jn98-19] ----------------------------------------------------------------------- DEPARTMENT OF COMMERCE Patent and Trademark Office 37 CFR Part 1 [Docket No: 960828235-8109-02] RIN: 0651-AA88 Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Disclosures AGENCY: Patent and Trademark Office, Commerce. ACTION: Final rule. ----------------------------------------------------------------------- SUMMARY: The Patent and Trademark Office (PTO) is amending the rules for submitting nucleotide or amino acid sequences in computer readable form (CRF) for patent applications. These amendments simplify the requirements of the rules, rearrange portions of the rules for better understanding and establish consistent rules to permit a single internationally acceptable computer readable form. Sequence Listings will be presented in an international, language neutral format using numeric identifiers rather than the current subject headings. The Paper Sequence Listing will preferably be a separately numbered section of the patent application. Sequences which contain fewer than four specifically identified nucleotides or amino acids will no longer be required to be submitted in computer readable form. DATES: Effective date: July 1, 1998. The incorporation by reference of certain publications listed in the regulations is approved by the Director of the Federal Register as of July 1, 1998. Applicability date: Sections 1.821 through 1.825 as amended apply to applications filed on or after July 1, 1998, except for: (1) applications that claim the benefit of a prior application under 35 U.S.C. 120 filed before July 1, 1998, and which do not add subject matter involving a sequence listing subject to Secs. 1.821 through 1.825; and (2) reissue applications in which the application for the patent sought to be reissued was filed before July 1, 1998. Sections 1.821 through 1.825 apply during a reexamination proceeding if the application for the patent sought to be reexamined was filed on or after July 1, 1998. FOR FURTHER INFORMATION CONTACT: Esther M. Kepplinger, by telephone at (703) 308-1495; by mail addressed to: Box Comments--Patents, Assistant Commissioner for Patents, Washington, DC 20231 marked to her attention; by facsimile to (703) 305-3935; or by electronic mail at esther.kepplinger@uspto.gov. SUPPLEMENTARY INFORMATION: Sections 1.821 through 1.825 of title 37 provide a standardized format for the description of nucleotide and amino acid sequence data in patent applications and require the submission of such sequences in computer readable form (CRF). Sections 1.821 through 1.825 provide the following benefits to the PTO: (1) Improved search capabilities; (2) improved interference detection; (3) more efficient examination; (4) cost savings for the input of the sequence data; (5) more [[Page 29621]] efficient and accurate printing of sequences in patents; (6) exchange of the sequence data with other patent offices electronically; and (7) improved public access to the sequences electronically. Reasons for the Changes In response to the needs of our customers, the procedural requirements found in former Secs. 1.821 through 1.825 have been reduced. Sections 1.821 through 1.825 are being amended to be consistent with World Intellectual Property Organization (WIPO) Standard ST.25 (signed in 1998 and effective July 1, 1998). ST.25 replaces WIPO Standards ST.23 and ST.24 which deal with paper and electronic submissions of sequence listings. A Meeting of International Authorities (MIA) under the Patent Cooperation Treaty (PCT) was held in November of 1994 to discuss simplification of sequence listing submission requirements. Under the previous PCT Regulations, each International Searching Authority, each International Preliminary Examining Authority and each designated/ elected office was free to set the requirements for submission of sequence listings in paper and electronic form. This imposed a burden on applicants by requiring them to prepare sequence listings in many different formats. In addition, sequence listings were required to be translated for consideration in the national stage at considerable cost to applicants and at the risk that the information could be inaccurately translated. After the November 1994 MIA, the PTO, the European Patent Office (EPO) and the Japanese Patent Office (JPO) worked together with WIPO to create a new international standard which forms the basis of WIPO Standard ST.25 (1998). Sections 1.821 through 1.825 of 37 CFR, as amended herein, are consistent with WIPO Standard ST.25 (1998) and the PCT sequence listing requirements. Sequence listings prepared in accordance with Secs. 1.821 through 1.825 as amended generally will be acceptable in all countries which adhere to WIPO Standard ST.25 (1998). In addition, a sequence listing prepared in accordance with the Secs. 1.821 through 1.825 as amended will be acceptable for the national stage in all PCT member countries which require the submission of a sequence listing. As a result of this rule change, applicants will experience a reduction in cost since only one sequence listing in paper and electronic form will need to be prepared and translations of this listing will not be needed. All necessary changes to the text of Secs. 1.821 through 1.825 to reflect the new WIPO Standard ST.25 (1998), have been made. Each change is described below. Overview of the Changes The changes in this Final Rule include: (1) Use of numeric identifiers to replace the language subject headings within the submission; (2) Elimination of unnecessary and confusing data elements; (3) Movement of the paper Sequence Listing to the end of the application, preferably with separately numbered pages; (4) Elimination of the requirement to provide a submission for sequences with fewer than four specifically defined nucleotides or amino acids; (5) Use of lower-case one-letter codes for nucleotide bases; (6) Rearrangement of portions of the rules to improve their context; (7) Clarification and simplification of the rules to aid in understanding; and (8) Minor changes to accomplish harmonization with WIPO Standard ST.25 (1998) as well as the EPO and the JPO standards. Amended Secs. 1.821 through 1.825 are not mandatory for: (1) applications that claim the benefit of a prior application under 35 U.S.C. 120 filed before July 1, 1998, and which do not add subject matter involving a sequence listing subject to Secs. 1.821 through 1.825; (2) reissue applications in which the application for the patent sought to be reissued was filed before July 1, 1998; and (3) reexamination proceedings if the application for the patent sought to be reexamined was filed before July 1, 1998. The PTO will accept and encourages the submission of sequence listings in compliance with amended Secs. 1.821 through 1.825 for any application or reexamination proceeding. All sequence listings (including the entire computer readable form) must be submitted in compliance with either Secs. 1.821 through 1.825 as amended in this Final Rule or (when permitted) former Secs. 1.821 through 1.825. If the CRF for a new application would be identical to a compliant CRF already on file in the PTO, the applicant may make reference to the other application and the CRF in lieu of filing a duplicate CRF in the new application by following the procedures set forth in Sec. 1.821(e). If exceptional circumstances do arise and certain applicants experience specific hardships in attempting to comply with amended Secs. 1.821 through 1.825, the PTO will consider a petition under Sec. 1.183 to waive certain requirements of Secs. 1.821 through 1.825. A Notice of Proposed Rulemaking entitled ``Changes Implementing Nucleotide and/or Amino Acid Sequence Listings'' (Notice of Proposed Rulemaking) was published in the Federal Register at 61 FR 51855 (October 4, 1996), and in the Official Gazette of the Patent and Trademark Office, at 1191 Off. Gaz. Pat. Office 168 (October 29, 1996). Sections 1.821 through 1.825 as adopted contain several changes from these sections. This Final Rule provides a discussion of the content of the specific rules being amended, description of the changes in the text of the proposed rules, and explanation of the reasons supporting the changes. In addition, comments received in response to the Notice of Proposed Rulemaking are analyzed. Discussion of Specific Rules and Changes from the Proposed Rules: Title 37 of the Code of Federal Regulations, Part 1, is amended as follows. Section 1.77 The proposed change to 37 CFR 1.77 was previously adopted. See Miscellaneous Changes to Patent Practice; Final Rule, 61 FR 42790 (August 19, 1996), 1190 Off. Gaz. Pat. Office 67 (September 17, 1996). Section 1.821 Section 1.821 incorporates by reference the World Intellectual Property Organization (WIPO) Handbook on Industrial Property Information and Documentation, Standard ST.25 (1998), including Tables 1 through 6 of Appendix 2, in accordance with 5 U.S.C. 552(a) and 1 CFR part 51. Copies may be obtained from the World Intellectual Property Organization; 34 chemin des Colombettes; 1211 Geneva 20 Switzerland. Copies may be inspected at the Patent Search Room; Crystal Plaza 3, Lobby Level; 2021 South Clark Place; Arlington, VA 22202. Copies may also be inspected at the Office of the Federal Register, 800 North Capitol Street, NW, Suite 700, Washington, DC 20408. These Tables are reproduced below. WIPO Standard ST.25 (1998), Appendix 2, Table 1, provides that the bases of a nucleotide sequence should be represented using the following one-letter code for nucleotide sequence characters: [[Page 29622]] Table 1.--One Letter Codes for Nucleotide Sequences ---------------------------------------------------------------------------------------------------------------- Symbol Meaning Origin of designation ---------------------------------------------------------------------------------------------------------------- a................................... a...................... adenine. g................................... g...................... guanine. c................................... c...................... cytosine. t................................... t...................... thymine. u................................... u...................... uracil. r................................... g or a................. purine. y................................... t/u or c............... pyrimidine. m................................... a or c................. amino. k................................... g or t/u............... keto. s................................... g or c................. strong interactions 3 H-bonds. w................................... a or t/u............... weak interactions 2 H-bonds. b................................... g or c or t/u.......... not a. d................................... a or g or t/u.......... not c. h................................... a or c or t/u.......... not g. v................................... a or g or c............ not t, not u. n................................... (a or g or c or t/u) or any (unknown or other). ---------------------------------------------------------------------------------------------------------------- WIPO Standard ST.25 (1998), Appendix 2, Table 2, provides that modified bases may be represented as the corresponding unmodified bases in the sequence itself, if the modified base is one of those listed below and the modification is further described in the Feature section of the Sequence Listing. The codes from the list below may be used in the description (i.e., the specification and drawings, or in the Sequence Listing) but these codes may not be used in the sequence itself. Table 2.--Modified Bases ------------------------------------------------------------------------ Symbol Meaning ------------------------------------------------------------------------ ac4c......................... 4-acetylcytidine. chm5u........................ 5-(carboxyhydroxylmethyl)uridine. cm........................... 2-O-methylcytidine. cmnm5s2u..................... 5-carboxymethylaminomethyl-2-thiouridine. cmnm5u....................... 5-carboxymethylaminomethyluridine. d............................ dihydrouridine. fm........................... 2-O-methylpseudouridine. gal q........................ beta, D-galactosylqueuosine. gm........................... 2-O-methylguanosine. I............................ inosine. i6a.......................... N6-isopentenyladenosine. m1a.......................... 1-methyladenosine. m1f.......................... 1-methylpseudouridine. m1g.......................... 1-methylguanosine. m1i.......................... 1-methylinosine. m22g......................... 2,2-dimethylguanosine. m2a.......................... 2-methyladenosine. m2g.......................... 2-methylguanosine. m3c.......................... 3-methylcytidine. m5c.......................... 5-methylcytidine. m6a.......................... N6-methyladenosine. m7g.......................... 7-methylguanosine. mam5u........................ 5-methylaminomethyluridine. mam5s2u...................... 5-methoxyaminomethyl-2-thiouridine. man q........................ beta, D-mannosylqueuosine. mcm5s2u...................... 5-methoxycarbonylmethyl-2-thiouridine. mcm5u........................ 5-methoxycarbonylmethyluridine. mo5u......................... 5-methoxyuridine. ms2i6a....................... 2-methylthio-N6-isopentenyladenosine. ms2t6a....................... N-((9-beta-D-ribofuranosyl-2- methylthiopurine-6-yl) carbamoyl) threonine. mt6a......................... N-((9-beta-D-ribofuranosylpurine-6-yl)N- methylcarbamoyl) threonine. mv........................... uridine-5-oxyacetic acid-methylester. o5u.......................... uridine-5-oxyacetic acid. osyw......................... wybutoxosine. p............................ pseudouridine. q............................ queuosine. s2c.......................... 2-thiocytidine. s2t.......................... 5-methyl-2-thiouridine. s2u.......................... 2-thiouridine. s4u.......................... 4-thiouridine. t............................ 5-methyluridine. t6a.......................... N-((9-beta-D-ribofuranosylpurine-6-yl)- carbamoyl)threonine. tm........................... 2-O-methyl-5-methyluridine. um........................... 2-O-methyluridine. [[Page 29623]] yw........................... wybutosine. x............................ 3-(3-amino-3-carboxy-propyl)uridine, (acp3)u. ------------------------------------------------------------------------ WIPO Standard ST.25 (1998), Appendix 2, Table 3, provides that the amino acids should be represented using the following three-letter code with the first letter as a capital. Table 3.--Amino Acid Three-Letter Codes ------------------------------------------------------------------------ Symbol Meaning ------------------------------------------------------------------------ Ala....................................... Alanine. Cys....................................... Cysteine. Asp....................................... Aspartic Acid. Glu....................................... Glutamic Acid. Phe....................................... Phenylalanine. Gly....................................... Glycine. His....................................... Histidine. Ile....................................... Isoleucine. Lys....................................... Lysine. Leu....................................... Leucine. Met....................................... Methionine. Asn....................................... Asparagine. Pro....................................... Proline. Gln....................................... Glutamine. Arg....................................... Arginine. Ser....................................... Serine. Thr....................................... Threonine. Val....................................... Valine. Trp....................................... Tryptophan. Tyr....................................... Tyrosine. Asx....................................... Asp or Asn. Glx....................................... Glu or Gln. Xaa....................................... Unknown or other. ------------------------------------------------------------------------ WIPO Standard ST.25 (1998), Appendix 2, Table 4, provides that modified and unusual amino acids may be represented as the corresponding unmodified amino acids in the sequence itself if the modified or unusual amino acid is one of those listed below and the modification is further described in the Feature section of the Sequence Listing. The codes from the list below may be used in the description (i.e., the specification and drawings, or in the Sequence Listing) but these codes may not be used in the sequence itself. Table 4.--Modified and Unusual Amino Acid Codes ---------------------------------------------------------------------------------------------------------------- Symbol Meaning ---------------------------------------------------------------------------------------------------------------- Aad........................................ 2-Aminoadipic acid. bAad....................................... 3-aminoadipic acid. bAla....................................... beta-Alanine, beta-Aminopropionic acid. Abu........................................ 2-Aminobutyric acid. 4Abu....................................... 4-Aminobutyric acid, piperidinic acid. Acp........................................ 6-Aminocaproic acid. Ahe........................................ 2-Aminoheptanoic acid. Aib........................................ 2-Aminoisobutyric acid. bAib....................................... 3-Aminoisobutyric acid. Apm........................................ 2-Aminopimelic acid. Dbu........................................ 2,4-Diaminobutyric acid. Des........................................ Desmosine. Dpm........................................ 2,2-Diaminopimelic acid. Dpr........................................ 2,3-Diaminopropionic acid. EtGly...................................... N-Ethylglycine. EtAsn...................................... N-Ethylasparagine. Hyl........................................ Hydroxylysine. aHyl....................................... allo-Hydroxylysine. 3Hyp....................................... 3-Hydroxyproline. 4Hyp....................................... 4-Hydroxyproline. Ide........................................ Isodesmosine. aIle....................................... allo-Isoleucine. MeGly...................................... N-Methylglycine, sarcosine. MeIle...................................... N-Methylisoleucine. MeLys...................................... 6-N-Methyllysine. MeVal...................................... N-Methylvaline. Nva........................................ Norvaline. Nle........................................ Norleucine. Orn........................................ Ornithine. ---------------------------------------------------------------------------------------------------------------- WIPO Standard ST.25 (1998), Appendix 2, Table 5 provides for feature keys related to DNA sequences. BILLING CODE 3510-16-P [[Page 29624]] [GRAPHIC] [TIFF OMITTED] TR01JN98.000 [[Page 29625]] [GRAPHIC] [TIFF OMITTED] TR01JN98.001 [[Page 29626]] [GRAPHIC] [TIFF OMITTED] TR01JN98.002 [[Page 29627]] [GRAPHIC] [TIFF OMITTED] TR01JN98.003 [[Page 29628]] [GRAPHIC] [TIFF OMITTED] TR01JN98.004 [[Page 29629]] [GRAPHIC] [TIFF OMITTED] TR01JN98.005 BILLING CODE 3510-16-C Further in paragraph (a) of Sec. 1.821, both occurrences of ``Copies of ST.23'' have been changed to ``Copies of WIPO Standard ST.25 (1998).'' This change is necessary to reflect the new standard number. In paragraph (a)(1) of Sec. 1.821, ``ST.23 (April 1994), paragraph 8'' has been changed to ``ST.25 (1998), Appendix 2, Table 1.'' This change reflects the correct information with regard to the incorporated WIPO standard and the list of symbols to be used for nucleotide sequence characters. Further in paragraph (a)(1) of Sec. 1.821, ``ST.23 (April 1994), paragraph 9'' has been changed to ``ST.25 (1998), Appendix 2, Table 2.'' This change reflects the correct information with regard to the incorporated WIPO standard and the list of modified bases which can be presented as unmodified nucleotide sequence characters. In paragraph (a)(2) of Sec. 1.821, all three occurrences of ``ST.23 (April 1994), paragraph 11'' have been changed to ``ST.25 (1998), Appendix 2, Table 3.'' This change reflects the correct information with regard to the incorporated WIPO standard and the list of symbols to be used for amino acid sequence characters. Further in paragraph (a)(2) of Sec. 1.821, ``ST.23 (April 1994), paragraph 12'' has been changed to ``ST.25 (1998), Appendix 2, Table 4.'' This change reflects the correct information with regard to the incorporated WIPO standard and the list of modified or unusual amino acids which can be presented as unmodified amino acid sequence characters. In paragraph (c) of Sec. 1.821, each of the three occurrences of the words ``integer identifier'' or ``integer identifiers'' has been changed to ``sequence identifier'' or ``sequence identifiers'' as appropriate. WIPO Standard ST.25 (1998), uses the term ``sequence identifier'' rather than ``integer identifier.'' Thus, this change is necessary to achieve harmonization with the international standard. In the last sentence of paragraph (c) of Sec. 1.821, the phrase ``The sequence omitted shall appear following the integer identifier'' of the proposed rule has been replaced by ``the code `000' shall be used in place of the sequence.'' The response for the numeric identifier <160> shall include the total number of SEQ ID NOs, whether followed by a sequence or by the code ``000''. The code ``000'' should be put into <400>. This change permits flexibility in the preparation and amendment of Sequence Listings. It also makes the rule language- neutral and is consistent with WIPO Standard ST.25 (1998). In paragraph (d) of Sec. 1.821, the words ``integer identifier'' have been changed to ``sequence identifier.'' WIPO Standard ST.25 (1998) uses the term ``sequence identifier'' rather than ``integer identifier.'' Thus, this change is necessary to achieve harmonization with the international standard. In paragraphs (f), (g) and (h) of Sec. 1.821, the sentence ``Such a statement must be a verified statement if made by a person not registered to practice before the Office'' has been deleted. The separate verification requirements in Sec. 1.821 have been eliminated in view of the recent amendment to Secs. 1.4(d) and 10.18. See Changes to Patent Practice and Procedure; Final Rule, 62 FR. 53131 (October 10, 1997), 1203 Off. Gaz. Pat. Office 63 (October 21, 1997). Paragraph (g) of Sec. 1.821 has also been amended to provide that the Office will provide a ``period of time'' (rather than one month) within which the applicant must comply with the requirements of Sec. 1.821(b) through (f) in order to avoid abandonment. Further in paragraph (f) of Sec. 1.821, the following has been added at the end of the first sentence, '', e.g., the information recorded in computer readable form is identical to the written sequence listing.'' WIPO Standard ST.25 (1998), paragraph 39, requires the language which has been added as an acceptable example for phrasing the required statement that the computer readable form and the written sequence listing are the same. Section 1.822 In paragraph (b) of Sec. 1.822, both references to WIPO Standard ST.23 (April 1994), paragraphs 8 and 11, as proposed have been changed to ``WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3.'' These changes reflect the correct information with regard to the incorporated WIPO standard and the lists of symbols for nucleotide and amino acid sequence characters. Further in paragraph (b) of Sec. 1.822, ``WIPO Standard ST.23 (April 1994), paragraphs 9 and 12'' as proposed has been changed to ``WIPO Standard ST.25 (1998), Appendix 2, Tables 2 and 4.'' [[Page 29630]] This change reflects the correct information with regard to the incorporated WIPO standard and the lists of modified bases and modified or unusual amino acids which can be depicted in the Sequence Listing via the symbols for a corresponding unmodified base or amino acid. Further in paragraph (b) of Sec. 1.822, the symbol designating an unknown nucleotide base or a nucleotide base other than those listed in the WIPO standard was proposed as an upper case letter ``N.'' This symbol has been changed to a lower case letter ``n.'' This change is consistent with the use of lower case letters for the symbols representing the nucleotide bases. Further in paragraph (b) of Sec. 1.822, the language has been clarified to specifically state that each ``n'' or ``Xaa'' represents only a single residue. Thus, for example, a single ``Xaa'' may not be used to designate a string of four amino acids, each of which is unknown. This represents a codification of existing practice. Further in paragraph (b) of Sec. 1.822, the information required in the Feature section to explain the use of ``n'' or ``Xaa'' in a given sequence is referred to ``as appropriate.'' Additional instruction is added at the end of paragraph (b) of Sec. 1.822 following ``the Feature section'' indicating'', preferably by including one or more feature keys listed in WIPO Standard ST.25 (1998), Appendix 2, Tables 5 and 6.'' This change specifies the preference for using the feature keys listed in the WIPO standard in order to aid applicants in filing a CRF which will comply with WIPO Standard ST.25 (1998). These feature keys are controlled vocabulary and are considered language neutral. Their use is required in a PCT patent application or a patent application in a foreign country which has adopted WIPO Standard ST.25 (1998). In paragraph (c)(1) of Sec. 1.822, ``WIPO Standard ST.23 (April 1994), paragraph 8'' as proposed has been changed to ``WIPO Standard ST.25 (1998), Appendix 2, Table 1.'' This change reflects the correct information with regard to the incorporated WIPO standard and the list of symbols to be used for nucleotide sequence characters. In paragraph (d)(1) of Sec. 1.822, ``WIPO Standard ST.23 (April 1994), paragraph 11, as proposed has been changed to ``WIPO Standard ST.25 (1998), Appendix 2, Table 3.'' This change reflects the correct information with regard to the incorporated WIPO standard and the list of symbols to be used for amino acid sequence characters. In paragraph (d)(4) of Sec. 1.822, the section notes that enumeration requirements are applicable to amino acid sequences that are circular in configuration. The following language has been added to the end of the paragraph '', with the exception that the designation of the first amino acid of the sequence may be made at the option of the applicant.'' This change is necessary to provide consistency with its counterpart of circular nucleotide sequences as provided in paragraph (c)(7) of Sec. 1.822. This change is also consistent with WIPO Standard ST.25 (1998), paragraph 21. In paragraph (e) of Sec. 1.822, the words ``integer identifiers'' have been changed to ``sequence identifiers .'' WIPO Standard ST.25 (1998) uses the term ``sequence identifier'' rather than ``integer identifier.'' Thus, this change is necessary to achieve harmonization with the international standard. Section 1.823 In paragraph (a) of Sec. 1.823, the entire second sentence which read ``On a separate page of the application specification, immediately prior to the claims, there shall be a reference to the presence of the `Sequence Listing' in a `Sequence Listing Annex.''' has been eliminated. The designation of the Sequence Listing as an annex to the specification was initially proposed in an early version of the international standard. This terminology is not used in WIPO Standard ST.25 (1998), however, and so it has also been eliminated from paragraph (a) of Sec. 1.823, as proposed. Simplification results as well by the elimination of the requirement that the Sequence Listing must be designated as an annex to the specification. In paragraph (a) of Sec. 1.823, the third sentence has been modified by deleting the words ``shall appear in the `Sequence Listing Annex,' which is.'' As explained above, the current version of the international standard does not require designating the Sequence Listing as an annex to the specification. In paragraph (a) of Sec. 1.823, the words ``preferably should be'' have been added to the third sentence, before ``numbered independently of the numbering of the remainder of the application'' to describe the independent page numbering of the Sequence Listing in paper copy form. The term ``preferably'' was added for purposes of harmonization with WIPO Standard ST.25 (1998). In paragraph (a) of Sec. 1.823, the last clause of the third sentence ``and shall be placed in the application file'' has been deleted as unnecessary and potentially confusing now that the reference to a ``Sequence Listing Annex'' has been removed from this paragraph. In paragraph (a) of Sec. 1.823, the fourth sentence has been eliminated in its entirety. As explained above, the current version of the international standard does not require designating the Sequence Listing as an annex to the specification. In paragraph (a) of Sec. 1.823, in both occurrences in the fifth sentence and in the single occurrence in the sixth sentence, the word ``shall'' has been changed to ``should.'' These changes are necessary for purposes of achieving consistency with WIPO Standard ST.25 (1998). In paragraph (b) of Sec. 1.823, the first sentence has been modified by the deletion of the words ``in addition to and immediately preceding.'' This change is consistent with WIPO Standard ST.25 (1998). In paragraph (b) of Sec. 1.823, the fifth sentence has been deleted, eliminating the prohibition of any item of information occupying more than one line. This change is consistent with WIPO Standard ST.25 (1998). In paragraph (b) of Sec. 1.823, the last sentence has been deleted to eliminate the ``rep'' designation for data elements of the ``Sequence Listing.'' Certain data elements may still be repeated within the listing but this change was made for harmonization of the table with WIPO Standard ST.25 (1998). In paragraph (b) of Sec. 1.823, the eighth sentence has been modified to reflect the new numeric numbering scheme, for harmonization with WIPO Standard ST.25 (1998). Specifically, ``<100> through <193>'' of the proposed rule has been changed to ``<110> through <170>.'' The table in paragraph (b) of Sec. 1.823, has been changed to reflect the revised numbering scheme and data elements used in WIPO Standard ST.25 (1998). The specific changes are as follows: Numeric identifier ``<100>, General Information,'' has been deleted from the proposed rules, as it is not present in WIPO Standard ST.25 (1998). Numeric identifier ``<110>, Applicant,'' in the proposed rule, has been changed to indicate that ``preferably '' a maximum of ten names may be indicated. This change allows for more than ten names in the Applicant field for those instances in which such would be appropriate. This change is consistent with WIPO Standard ST.25 (1998). Numeric identifier ``<120>, Title of Invention,'' in the proposed rule, has been changed to eliminate the limitation [[Page 29631]] that the title be a maximum of four lines. This change allows applicants more flexibility with respect to the title. This change is consistent with WIPO Standard ST.25 (1998). Numeric identifier ``<130>, Number of Sequences,'' in the proposed rule, has been changed to reflect ``<130>, File Reference,'' as stated in WIPO Standard ST.25 (1998). This numeric identifier was indicated as ``<183>, File Reference/Docket Number '', in the rule as proposed. As proposed this was an optional numeric identifier. The numeric identifier remains optional once the application has been assigned an application number, e.g., a serial number. This numeric identifier is now MANDATORY when an application number has not yet been assigned to the application, such as on the day the application is initially filed. This change will assist in the matching of sequence information submissions with an application in the event that either the paper copy or the computer readable form were to become separated from the remainder of the application. This change is consistent with WIPO Standard ST.25 (1998). The Number of Sequences field identified as ``<130>'' in the proposed rule is now numbered ``<160>'' in Sec. 1.823 as adopted and redefined as ``Number of SEQ ID NOs.'' The information associated with numeric identifiers ``<140>'' through ``<153>,'' ``Correspondence Address'' through ``Operating System'' of the proposed rule, has been eliminated to reduce the burden on the applicant and to harmonize with WIPO Standard ST.25 (1998). Some of these numeric identifiers have been used in the new numbering scheme and have been associated with different information as indicated herein and in the Table of Sec. 1.823. One remaining numeric identifier within the Computer Readable Form section, ``<154>, Software,'' of the proposed rule, will remain, with the exception that it has been reassigned the numeric identifier of ``<170>'' to reflect the numbering scheme presented in WIPO Standard ST.25 (1998). The main headings ``<160>, Current Application Data'' and ``<170>, Prior Application Data,'' of the proposed rules, have been eliminated to harmonize with WIPO Standard ST.25 (1998) and reduce the number of fields in the Sequence Listing. The information that was to appear under these main headings remains in the rules but has been reassigned numeric identifiers <140> through <151>. The specific changes are as follows: ``<160>'' has been redefined as ``Number of SEQ ID NOs ''; ``<161>, Application Number,'' of the proposed rule is now numbered as ``<140>,'' and is defined as ``Current Application Number'; ``<162>, Filing Date,'' of the proposed rule is now numbered ``<141>,'' and is defined as ``Current Filing Date''; ``<170>'' has been redefined as ``Software ``; ``<171>, Application Number,'' of the proposed rule is now numbered as ``<150>,'' and is defined as ``Prior Application Number''; ``<172>, Filing Date,'' of the proposed rule is now numbered as ``<151>,'' and is defined as ``Prior Application Filing Date.'' The numeric identifiers now numbered ``<150>, Prior Application Number,'', and ``<151>, Prior Application Filing Date,'' are now mandatory only in those instances in which a claim for priority with respect to those prior applications is being made under either 35 U.S.C. 119 or 120. This change will provide information in this regard when it is most useful and was necessary to harmonize these rules with WIPO Standard ST.25 (1998). Throughout the Sequence Listing, application numbers must be set forth as a combination of the two digit country code, as set forth in WIPO Standard ST.3, as well as an application number in accordance with WIPO Standard ST.13 or for an international application, the numbering system as set out in Section 307(a) of the Administrative Instructions under the PCT. Numeric identifiers ``<180>, Attorney/Agent Information,'' through ``<182>, Registration Number,'' of the proposed rule, have been eliminated to harmonize with WIPO Standard ST.25 (1998) and reduce the number of fields in the Sequence Listing. Numeric identifier ``<183>, File Reference/Docket Number'' of the proposed rule has been reassigned as numeric identifier ``<130>,'' and redefined as ``File Reference'' in an effort to harmonize with WIPO Standard ST.25 (1998). The Telecommunication Information section, ``<190>'' through ``<193>'' of the proposed rules, has been eliminated in order to reduce the number of fields in the Sequence Listing and harmonize with WIPO Standard ST.25 (1998). Numeric identifier ``<200>, Information for SEQ ID NO:#:'', has been reassigned the numeric identifier ``<210>, SEQ ID NO: #:'' This numeric identifier indicates the integer, referred to in these final rules as the sequence identifier for both the sequence information and the actual sequence which follows the information. Numeric identifier ``<210>, Sequence Characteristics,'' of the proposed rule has been eliminated in order to reduce the number of required elements in the Sequence Listing and harmonize with WIPO Standard ST.25 (1998). The valid responses for the mandatory numeric identifier ``<212>, Type,'' have been changed from ``N'' and ``A'', as stated in the proposed rule, to ``DNA,'' ``RNA,'' and ``PRT'' (protein) in order to harmonize with WIPO Standard ST.25 (1998). A compound that is a mixture of DNA and RNA should be represented by ``DNA.'' This change is consistent with WIPO Standard ST.25 (1998). Numeric identifier ``<213>, Organism,'' has been added to the Sequence Listing of these final rules in an effort to harmonize with WIPO Standard ST.25 (1998). A response for the Organism identifier is MANDATORY. The valid responses are the scientific name, i.e. ``Genus species'', ``Artificial Sequence'', or ``Unknown.'' Numeric identifier ``<214>, Topology,'' of the proposed rule, has been eliminated to harmonize with WIPO Standard ST.25 (1998), and to reduce the burden on the applicant. Numeric identifier ``<290>, Feature,'' has become numeric identifier ``<220>, Feature.'' This numeric identifier has become MANDATORY for those sequences in which numeric identifier ``<213>, Organism,'' is completed with either ``Artificial Sequence'' or ``Unknown.'' This numeric identifier is also required if the compound sequence is a mixture of DNA and RNA. Numeric identifier ``<220>, Feature'' is a header only. No data are added immediately following this numeric identifier. These changes are required to achieve harmonization with WIPO Standard ST.25 (1998). Numeric identifier ``<291>, Name/Key,'' has become numeric identifier ``<221>, Name/Key.'' As proposed, the information provided was restricted to a maximum of four lines. The four line restriction has been removed to reduce the limitations on this field. The comment section of this numeric identifier has been changed in that it now indicates that the selection of a feature name or feature key is preferably made from those listed in Tables 5 and 6 of WIPO Standard ST.25 (1998). These tables are reproduced above and this preference for the listed feature names and keys is consistent with the requirement of WIPO Standard ST.25 (1998). Numeric identifier ``<292>, Location,'' has become ``<222>, Location,'' so as to be consistent with the numeric identifiers contained in WIPO Standard ST.25 (1998). [[Page 29632]] Numeric identifier ``<294>, Other Information,'' has become numeric identifier ``<223>, Other Information,'' so as to be consistent with the numeric identifiers contained in WIPO Standard ST.25 (1998). This numeric identifier has become MANDATORY for those sequences in which numeric identifier ``<213>, Organism,'' is completed with either ``Artificial Sequence'' or ``Unknown''. Numeric identifier ``<223>, Other Information,'' should contain source information in those instances when the organism is unknown or is an artificial sequence. For example, the source may be unknown because the material was isolated from a mixed bacterial culture rather than a pure culture. In such a case, numeric identifier ``<223>, Other Information,'' should be completed by explaining the mixed culture source of the sequenced material. If a sequence is completely synthesized this should be indicated in numeric identifier ``<223>, Other Information,'' while numeric identifier ``<213>, Organism,'' would indicate ``Artificial Sequence.'' This change has been made to accomplish harmonization between these rules and WIPO Standard ST.25 (1998) which contains the same mandatory requirement in this regard. Numeric identifiers ``<308>'' through ``<310>,'' referring to the `` Patent Document Number,'' ``Filing Date'' and `` Publication Date,'' of the proposed rule, have been moved to numeric identifiers ``<310>'' to ``<312>,'' respectively, of this Final Rule in order to harmonize with the numeric numbering scheme of WIPO Standard ST.25 (1998). Citations in the Sequence Listing must comply with WIPO Standard ST.6 for publication numbers and WIPO Standard ST.16 for document codes. New numeric identifiers ``<308>, Database Accession Number,'' and ``<309> Database Entry Date,'' have been added to the final rules to harmonize with WIPO Standard ST.25 (1998). These fields were added to the publication information section of WIPO Standard ST.25 (1998) to give an applicant more opportunity to further identify a published citation. Numeric identifier <400> ``Sequence Description: SEQ ID NO:#:'' has been changed to ``Sequence `` for clarity. Also for clarity, the explanation in the table has been changed to ``SEQ ID NO shall follow the numeric identifier and should appear on the line preceding the sequence.'' The format of the date fields has been changed throughout the Sequence Listing to accommodate for international conventions. All date fields referenced in the Sequence Listing shall conform to WIPO Standard ST.2. Because compliance with Secs. 1.821 through 1.825 as amended should produce Sequence Listings that are acceptable to all receiving offices, a standardized date field convention was required. Section 1.824 In paragraph (a)(6) of Sec. 1.824, ``, the date on which the data were recorded on the computer readable form'' was added after ``title of the invention'' to harmonize with WIPO Standard ST.25 (1998) requirements. While this requirement of Sec. 1.824 was proposed to be eliminated, that proposal is not adopted for purposes of harmonization with WIPO Standard ST.25 (1998). Also in paragraph (a)(6) of Sec. 1.824, `` name and type of computer and'' was deleted to reduce the requirements. Section 1.825 In paragraphs (a), (b), and (d) of Sec. 1.825, the sentence ``Such a statement must be a verified statement if made by a person not registered to practice before the Office'' has been deleted. The separate verification requirements in Sec. 1.825 have been eliminated in view of the recent amendment to Secs. 1.4(d) and 10.18. See Changes to Patent Practice and Procedure; Final Rule, 62 FR. 53131 (October 10, 1997), 1203 Off. Gaz. Pat. Office 63 (October 21, 1997). Response to and Analysis of Comments Six written comments were received in response to the Notice of Proposed Rulemaking. Several of these comments address the three specific queries set forth in the Notice of Proposed Rulemaking. The first query posed in the Notice of Proposed Rulemaking was: (1) Should the PTO accept voluntary submissions of computer readable forms and Sequence Listings where a D-amino acid is contained in the sequence? If such voluntary submissions are accepted, should there be a restriction on the choice of identifying a D-amino acid by an Xaa or by its L-amino acid counterpart abbreviation? Comment: One comment indicated that not only should the PTO accept voluntary submissions under these rules where a D-amino acid is contained in the sequence, the Office should make such submissions mandatory and designated by an Xaa. One comment indicated that sequences containing D-amino acids should not be in the PTO databases. Response: Upon careful consideration, the PTO has decided to accept voluntary submissions of protein sequences containing D-amino acids. The PTO strongly encourages anyone making such voluntary submissions to identify a D-amino acid with an Xaa, describing the D-amino acid in the Features section of the Sequence Listing. This section is indicated by numeric identifiers <220> through <223> in 37 CFR 1.823. Procedural concerns compel this acceptance of voluntary submissions. Computer readable forms are processed prior to examination. It is cumbersome to establish a viable procedure to redact any voluntary submissions out of the PTO database. The use of Xaa to indicate a D-amino acid, should such sequence information be submitted in accordance with these rules, is encouraged so as to alert anyone reviewing the sequence that a particular amino acid is other than a naturally occurring L-amino acid and to more accurately depict the extent of similarities between such a sequence and the L-amino acid containing sequences present in a database being searched for examination or other purposes. Because the sequence databases do not currently include D-amino acids in sequences and thus are not searchable for such sequences, the submission of those sequences containing D-amino acids will not be made mandatory. The second query posed in the proposed rules was: (2) Should the provisions of 37 CFR 1.821(c) be altered to exclude some prior art sequences from inclusion in the Sequence Listing even though they are presented in a patent application disclosure as sequences? Should the reference to an accession number of an admitted prior art sequence in a publicly available, electronic, sequence database suffice and exclude that sequence from the requirements of the sequence rules? Comment: Four comments indicated that known ``prior art'' sequences should not be required in the Sequence Listing. A referral to a publicly available, electronic, sequence database for access to such ``prior art'' sequences would be an acceptable alternative to two of those commenting on this aspect; the other two did not address this point. The reasons given for excluding such sequences are the expense and time required by applicants and their representatives in the inclusion of ``prior art'' sequences that are considered to be ``non- inventive''. Reducing the bulk of the paper copy of the Sequence Listing was also mentioned. Response: The requirement to submit all disclosed sequences in the format required by Secs. 1.821 through 1.825 is [[Page 29633]] maintained. This point was discussed with officials from the JPO and EPO. The offices have considered the stated concerns with regard to costs to applicants. Sections 1.821 through 1.825 do not require any information to be disclosed in the form of a sequence, but rather require a particular format whenever information is presented in the form of a sequence. Those applicants for whom compliance with the rules remains a significant hardship may petition under Sec. 1.183 for a waiver of the applicable requirement of Secs. 1.821 through 1.825. The technical and legal concerns mentioned in the Notice of Proposed Rulemaking still exist concerning the use of an alternative reference to a publicly available, electronic, sequence database. These concerns are: (1) What constitutes a publicly available, electronic, sequence database? (2) Would the USPTO and the other patent offices which have similar rules be required to produce a list of internationally accepted databases? (3) What would be the criteria for such acceptance? (4) An additional issue would exist involving electronic records maintenance: is there any assurance that once information is contained in a database that it will be retained and available indefinitely without alteration? Changes to the information in nucleic acid sequence databases resulting from the discovery of sequencing errors are well-known. (5) Does the mere existence of the sequence information in such a record constitute reasonable means of retrieval? In other words, would one need some text basis or other identifier to retrieve the information? Additional reasons for the inclusion of these prior art sequences remain relevant. These reasons are: (1) the assessment of whether a particular sequence falls within the requirements of the current rules is simple; (2) the general public is assured that all patents which contain any sequence information contain all of the sequence information in the Sequence Listing and all sequences are available in a computer accessible form; and (3) as a publication, the contextual association of new and old information is potentially unique to the patent and very valuable to anyone assessing the state of the art at the time of a patented invention, and thus are desirable to be present in electronic form in association with that patent. The third query posed in the proposed rules was: (3) Should Sequence Listings filed in an international application filed under the PCT be published only electronically and made available for retrieval electronically by an accession number from several sequence repositories? Comment: Two comments were received in response to this query, one in favor and one opposed to limiting the publication of the Sequence Listing to an electronic form for published PCT applications in the international phase. Response: At this time paper copies of the Sequence Listings filed as part of the description will continue to be published in applications filed under PCT. The PTO together with the EPO, JPO and WIPO will continue to discuss the possibility of electronic publication. However, any implementation of such electronic publication in lieu of publication in paper form will not be undertaken until further study has been completed. Comment: One comment suggested that informative English words be placed next to the numerical headings in the Sequence Listing as printed in a U.S. patent. Response: The PTO will provide English words corresponding to the numeric identifiers in the printed U.S. patents. Comment: One comment suggested addition of a descriptive comment line to the Sequence Listing. Response: The ``Other Information'' line in the Features section, which is numeric identifier <223> in Sec. 1.823, provides for a description of a sequence. While completion of this section is only mandatory when the sequence contains ``n'', ``Xaa'', a modified or unusual L-amino acid or a modified base, it is frequently completed in other circumstances. Comment: One comment requested we harmonize Secs. 1.821 through 1.825 with PCT, EPO and other authorities such that the differences in the requirements for Sequence Listing submissions are minimal. Response: This change to Secs. 1.821 through 1.825 is the result of such an effort to harmonize the PTO, PCT, EPO and JPO Sequence Listing requirements to the extent possible. The requirements of newly developed WIPO ST.25 are substantially identical to the requirements of amended Secs. 1.821 through 1.825. PatentIn Version 2.0 software, now available, is drafted to meet all of the requirements of WIPO Standard ST.25 (1998). The requirements of Secs. 1.821 through 1.825, however, are less stringent than the requirements of WIPO Standard ST.25 (1998). Thus, applicants who wish to file in countries which adhere to WIPO Standard ST.25 (1998) should consider the following when not using PatentIn Version 2.0: 1. The WIPO Standard ST.25 (1998) does not permit submissions using a Macintosh computer. 2. The WIPO Standard ST.25 (1998) does not accept the range of media permitted by amended Secs. 1.821 through 1.825. 3. The answers in field <221> and <222> must use selections from Tables 5 and 6 of WIPO Standard ST.25 (1998) to comply with that standard. The terms from these Tables are considered language neutral vocabulary. 4. Any free text in numeric identifier <223> of a Sequence Listing will not be translated and thus must also appear in the specification of applications filed under WIPO Standard ST.25 (1998) for compliance. 5. A CRF filed after the filing of an application under the PCT does not form part of the disclosure and will not be published in the pamphlet. 6. Paragraph 39 of WIPO Standard ST.25 (1998) requires the specific wording ``the information recorded on the form is identical to the written sequence listing.'' 7. WIPO Standard ST.25 (1998), paragraph 24, requires spaces between specified numeric identifiers in the Sequence Listing. Comment: One comment requested a WINDOWS based version of PatentIn. Response: A WINDOWS based version of PatentIn, PatentIn 2.0, has been developed through a Trilaterally-sponsored joint initiative and is being made available. Comment: One comment expressed concern over application of the doctrine of equivalents by the courts to sequence-based claim language. Response: Sections 1.821 through 1.825 do not establish a disclosure requirement, nor do they alter the requirements of 35 U.S.C. Sec. 112. They merely require a particular format whenever information is presented in the form of a sequence. The use of sequence identification numbers (SEQ ID NO: #) only provides a shorthand way for applicants to refer to sequence information. These identification numbers do not in any way restrict the manner in which an invention can be claimed. Similarly, the use of this format does not impact the potential interpretations and legal determinations that could be made with respect to claims containing information in the form of a nucleotide or amino acid sequence. Comment: One comment requested the flexibility to use single-letter amino acid codes. Response: Sections 1.821 through 1.825 as amended do not constrain an applicant from using single letter codes [[Page 29634]] in the disclosure. The requirements of the sequence searching and the sequence storage mechanisms include only the three-letter codes, thus the need for the constraint on the Sequence Listing information. There is no such restriction on the sequence format in the body of the disclosure or in the figures imposed by Secs. 1.821 through 1.825, or any of the rules of practice; only the format for the Sequence Listing is specified by Secs. 1.821 through 1.825. Review Under the Paperwork Reduction Act of 1995 Notwithstanding any other provision of law, no person is required to respond to nor shall a person be subject to a penalty for failure to comply with a collection of information subject to the requirements of the Paperwork Reduction Act (PRA) unless that collection of information displays a currently valid OMB control number. This rule contains collections of information requirements subject to the PRA. The principal impact of this Final Rule is: (1) Elimination of certain requirements of Secs. 1.821 through 1.825; and (2) revision of Secs. 1.821 through 1.825 for consistency with WIPO Standard ST.25 (1998), which will permit Sequence Listings to be presented in an international, language neutral format. The public reporting burden for these collections of information have been approved by the Office of Management and Budget (OMB) under OMB control number 0651-0024. The public reporting burden for this collection of information is estimated to average 80 minutes per response, including the time for reviewing instructions, searching existing data sources, gathering and maintaining the information. Send comments regarding this burden estimate or any other aspect of the data requirements, including suggestions for reducing this burden, to Esther M. Kepplinger at the address specified above or to the Office of Information and Regulatory Affairs of OMB, New Executive Office Bldg., 725 17th St. NW, rm. 10235, Washington, DC 20230, Attn: Desk Officer for the Patent and Trademark Office. Other Considerations This Final Rule is in conformity with the requirements of the Regulatory Flexibility Act (5 U.S.C. 601 et seq.), Executive Order 12612 (October 26, 1987), and the Paperwork Reduction Act of 1995 (44 U.S.C. 3501 et seq.). It has been determined that this rulemaking is not significant for the purposes of Executive Order 12866 (September 30, 1993). The Assistant General Counsel for Legislation and Regulation of the Department of Commerce has certified to the Chief Counsel for Advocacy, Small Business Administration that this Final Rule would not have a significant impact on a substantial number of small entities (Regulatory Flexibility Act, 5 U.S.C. 605(b)). The principal impact of this Final Rule is: (1) Elimination of certain requirements of Secs. 1.821 through 1.825; and (2) revision of Secs. 1.821 through 1.825 for consistency with WIPO Standard ST.25 (1998), which will permit Sequence Listings to be presented in an international, language neutral format. The Office has determined that this Final Rule has no Federalism implications affecting the relationship between the National Government and the States as outlined in Executive Order 12612. List of Subjects in 37 CFR Part 1 Administrative practice and procedure, Courts, Freedom of information, Inventions and patents, Incorporation by reference, Reporting and recordkeeping requirements, Small businesses. For the reasons set forth in the preamble and under the authority granted to the Commissioner of Patents and Trademarks by 35 U.S.C. 6, Title 37 of the Code of Federal Regulations, part 1, is amended as follows: PART 1--RULES OF PRACTICE IN PATENT CASES 1. The authority citation for 37 CFR part 1 continues to read as follows: Authority: 35 U.S.C. 6, unless otherwise noted. 2. Section 1.821 is revised to read as follows: Sec. 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and/or amino acid sequences as used in Secs. 1.821 through 1.825 are interpreted to mean an unbranched sequence of four or more amino acids or an unbranched sequence of ten or more nucleotides. Branched sequences are specifically excluded from this definition. Sequences with fewer than four specifically defined nucleotides or amino acids are specifically excluded from this section. ``Specifically defined'' means those amino acids other than ``Xaa'' and those nucleotide bases other than ``n'' defined in accordance with the World Intellectual Property Organization (WIPO) Handbook on Industrial Property Information and Documentation, Standard ST.25: Standard for the Presentation of Nucleotide and Amino Acid Sequence Listings in Patent Applications (1998), including Tables 1 through 6 in Appendix 2, herein incorporated by reference. (Hereinafter ``WIPO Standard ST.25 (1998)''). This incorporation by reference was approved by the Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51. Copies of WIPO Standard ST.25 (1998) may be obtained from the World Intellectual Property Organization; 34 chemin des Colombettes; 1211 Geneva 20 Switzerland. Copies of ST.25 may be inspected at the Patent Search Room; Crystal Plaza 3, Lobby Level; 2021 South Clark Place; Arlington, VA 22202. Copies may also be inspected at the Office of the Federal Register, 800 North Capitol Street, NW, Suite 700, Washington, DC. Nucleotides and amino acids are further defined as follows: (1) Nucleotides: Nucleotides are intended to embrace only those nucleotides that can be represented using the symbols set forth in WIPO Standard ST.25 (1998), Appendix 2, Table 1. Modifications, e.g., methylated bases, may be described as set forth in WIPO Standard ST.25 (1998), Appendix 2, Table 2, but shall not be shown explicitly in the nucleotide sequence. (2) Amino acids: Amino acids are those L-amino acids commonly found in naturally occurring proteins and are listed in WIPO Standard ST.25 (1998), Appendix 2, Table 3. Those amino acid sequences containing D- amino acids are not intended to be embraced by this definition. Any amino acid sequence that contains post-translationally modified amino acids may be described as the amino acid sequence that is initially translated using the symbols shown in WIPO Standard ST.25 (1998), Appendix 2, Table 3 with the modified positions; e.g., hydroxylations or glycosylations, being described as set forth in WIPO Standard ST.25 (1998), Appendix 2, Table 4, but these modifications shall not be shown explicitly in the amino acid sequence. Any peptide or protein that can be expressed as a sequence using the symbols in WIPO Standard ST.25 (1998), Appendix 2, Table 3 in conjunction with a description in the Feature section to describe, for example, modified linkages, cross links and end caps, non-peptidyl bonds, etc., is embraced by this definition. (b) Patent applications which contain disclosures of nucleotide and/or amino acid sequences, in accordance with the definition in paragraph (a) of this section, shall, with regard to the manner in which the nucleotide and/or amino [[Page 29635]] acid sequences are presented and described, conform exclusively to the requirements of Secs. 1.821 through 1.825. (c) Patent applications which contain disclosures of nucleotide and/or amino acid sequences must contain, as a separate part of the disclosure, a paper copy disclosing the nucleotide and/or amino acid sequences and associated information using the symbols and format in accordance with the requirements of Secs. 1.822 and 1.823. This paper copy is hereinafter referred to as the ``Sequence Listing.'' Each sequence disclosed must appear separately in the ``Sequence Listing.'' Each sequence set forth in the ``Sequence Listing'' shall be assigned a separate sequence identifier. The sequence identifiers shall begin with 1 and increase sequentially by integers. If no sequence is present for a sequence identifier, the code ``000'' shall be used in place of the sequence. The response for the numeric identifier <160> shall include the total number of SEQ ID NOs, whether followed by a sequence or by the code ``000.'' (d) Where the description or claims of a patent application discuss a sequence that is set forth in the ``Sequence Listing'' in accordance with paragraph (c) of this section, reference must be made to the sequence by use of the sequence identifier, preceded by ``SEQ ID NO:'' in the text of the description or claims, even if the sequence is also embedded in the text of the description or claims of the patent application. (e) A copy of the ``Sequence Listing'' referred to in paragraph (c) of this section must also be submitted in computer readable form in accordance with the requirements of Sec. 1.824. The computer readable form is a copy of the ``Sequence Listing'' and will not necessarily be retained as a part of the patent application file. If the computer readable form of a new application is to be identical with the computer readable form of another application of the applicant on file in the Patent and Trademark Office, reference may be made to the other application and computer readable form in lieu of filing a duplicate computer readable form in the new application if the computer readable form in the other application was compliant with all of the requirements of these rules. The new application shall be accompanied by a letter making such reference to the other application and computer readable form, both of which shall be completely identified. In the new application, applicant must also request the use of the compliant computer readable ``Sequence Listing'' that is already on file for the other application and must state that the paper copy of the ``Sequence Listing'' in the new application is identical to the computer readable copy filed for the other application. (f) In addition to the paper copy required by paragraph (c) of this section and the computer readable form required by paragraph (e) of this section, a statement that the content of the paper and computer readable copies are the same must be submitted with the computer readable form, e.g., a statement that ``the information recorded in computer readable form is identical to the written sequence listing.'' (g) If any of the requirements of paragraphs (b) through (f) of this section are not satisfied at the time of filing under 35 U.S.C. 111(a) or at the time of entering the national stage under 35 U.S.C. 371, applicant will be notified and given a period of time within which to comply with such requirements in order to prevent abandonment of the application. Any submission in reply to a requirement under this paragraph must be accompanied by a statement that the submission includes no new matter. (h) If any of the requirements of paragraphs (b) through (f) of this section are not satisfied at the time of filing an international application under the Patent Cooperation Treaty (PCT), which application is to be searched by the United States International Searching Authority or examined by the United States International Preliminary Examining Authority, applicant will be sent a notice necessitating compliance with the requirements within a prescribed time period. Any submission in reply to a requirement under this paragraph must be accompanied by a statement that the submission does not include matter which goes beyond the disclosure in the international application as filed. If applicant fails to timely provide the required computer readable form, the United States International Searching Authority shall search only to the extent that a meaningful search can be performed without the computer readable form and the United States International Preliminary Examining Authority shall examine only to the extent that a meaningful examination can be performed without the computer readable form. 3. Section 1.822 is revised to read as follows: Sec. 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall conform to the requirements of paragraphs (b) through (e) of this section. (b) The code for representing the nucleotide and/or amino acid sequence characters shall conform to the code set forth in the tables in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3. This incorporation by reference was approved by the Director of the Federal Register in accordance with 5 U.S.C. 552(a) and 1 CFR part 51. Copies of ST.25 may be obtained from the World Intellectual Property Organization; 34 chemin des Colombettes; 1211 Geneva 20 Switzerland. Copies of ST.25 may be inspected at the Patent Search Room; Crystal Plaza 3, Lobby Level; 2021 South Clark Place; Arlington, VA 22202. Copies may also be inspected at the Office of the Federal Register, 800 North Capitol Street, NW, Suite 700, Washington, DC. No code other than that specified in these sections shall be used in nucleotide and amino acid sequences. A modified base or modified or unusual amino acid may be presented in a given sequence as the corresponding unmodified base or amino acid if the modified base or modified or unusual amino acid is one of those listed in WIPO Standard ST.25 (1998), Appendix 2, Tables 2 and 4, and the modification is also set forth in the Feature section. Otherwise, each occurrence of a base or amino acid not appearing in WIPO Standard ST.25 (1998), Appendix 2, Tables 1 and 3, shall be listed in a given sequence as ``n'' or ``Xaa,'' respectively, with further information, as appropriate, given in the Feature section, preferably by including one or more feature keys listed in WIPO Standard ST.25 (1998), Appendix 2, Tables 5 and 6. (c) Format representation of nucleotides. (1) A nucleotide sequence shall be listed using the lower-case letter for representing the one- letter code for the nucleotide bases set forth in WIPO Standard ST.25 (1998), Appendix 2, Table 1. (2) The bases in a nucleotide sequence (including introns) shall be listed in groups of 10 bases except in the coding parts of the sequence. Leftover bases, fewer than 10 in number, at the end of noncoding parts of a sequence shall be grouped together and separated from adjacent groups of 10 or 3 bases by a space. (3) The bases in the coding parts of a nucleotide sequence shall be listed as triplets (codons). The amino acids corresponding to the codons in the coding parts of a nucleotide sequence shall be typed immediately below the corresponding codons. Where a codon spans an intron, the amino acid symbol [[Page 29636]] shall be typed below the portion of the codon containing two nucleotides. (4) A nucleotide sequence shall be listed with a maximum of 16 codons or 60 bases per line, with a space provided between each codon or group of 10 bases. (5) A nucleotide sequence shall be presented, only by a single strand, in the 5 to 3 direction, from left to right. (6) The enumeration of nucleotide bases shall start at the first base of the sequence with number 1. The enumeration shall be continuous through the whole sequence in the direction 5 to 3. The enumeration shall be marked in the right margin, next to the line containing the one-letter codes for the bases, and giving the number of the last base of that line. (7) For those nucleotide sequences that are circular in configuration, the enumeration method set forth in paragraph (c)(6) of this section remains applicable with the exception that the designation of the first base of the nucleotide sequence may be made at the option of the applicant. (d) Representation of amino acids. (1) The amino acids in a protein or peptide sequence shall be listed using the three-letter abbreviation with the first letter as an upper case character, as in WIPO Standard ST.25 (1998), Appendix 2, Table 3. (2) A protein or peptide sequence shall be listed with a maximum of 16 amino acids per line, with a space provided between each amino acid. (3) An amino acid sequence shall be presented in the amino to carboxy direction, from left to right, and the amino and carboxy groups shall not be presented in the sequence. (4) The enumeration of amino acids may start at the first amino acid of the first mature protein, with the number 1. When presented, the amino acids preceding the mature protein, e.g., pre-sequences, pro- sequences, pre-pro-sequences and signal sequences, shall have negative numbers, counting backwards starting with the amino acid next to number 1. Otherwise, the enumeration of amino acids shall start at the first amino acid at the amino terminal as number 1. It shall be marked below the sequence every 5 amino acids. The enumeration method for amino acid sequences that is set forth in this section remains applicable for amino acid sequences that are circular in configuration, with the exception that the designation of the first amino acid of the sequence may be made at the option of the applicant. (5) An amino acid sequence that contains internal terminator symbols (e.g., ``Ter'', ``*'', or ``.'', etc.) may not be represented as a single amino acid sequence, but shall be presented as separate amino acid sequences. (e) A sequence with a gap or gaps shall be presented as a plurality of separate sequences, with separate sequence identifiers, with the number of separate sequences being equal in number to the number of continuous strings of sequence data. A sequence that is made up of one or more noncontiguous segments of a larger sequence or segments from different sequences shall be presented as a separate sequence. 4. Section 1.823 is revised to read as follows: Sec. 1.823 Requirements for nucleotide and/or amino acid sequences as part of the application papers. (a) The ``Sequence Listing'' required by Sec. 1.821(c), setting forth the nucleotide and/or amino acid sequences and associated information in accordance with paragraph (b) of this section, must begin on a new page and must be titled ``Sequence Listing''. The ``Sequence Listing'' preferably should be numbered independently of the numbering of the remainder of the application. Each page of the ``Sequence Listing'' should contain no more than 66 lines and each line should contain no more than 72 characters. A fixed-width font should be used exclusively throughout the ``Sequence Listing.'' (b) The ``Sequence Listing'' shall, except as otherwise indicated, include the actual nucleotide and/or amino acid sequence, the numeric identifiers and their accompanying information as shown in the following table. The numeric identifier shall be used only in the ``Sequence Listing.'' The order and presentation of the items of information in the ``Sequence Listing'' shall conform to the arrangement given below. Each item of information shall begin on a new line and shall begin with the numeric identifier enclosed in angle brackets as shown. The submission of those items of information designated with an ``M'' is mandatory. The submission of those items of information designated with an ``O'' is optional. Numeric identifiers <110> through <170> shall only be set forth at the beginning of the ``Sequence Listing.'' The following table illustrates the numeric identifiers. ---------------------------------------------------------------------------------------------------------------- Mandatory (M) or optional Numeric identifier Definition Comments and format (O). ---------------------------------------------------------------------------------------------------------------- <110>.................... Applicant............... Preferably max. of 10 names; M. one name per line; preferable format: Surname, Other Names and/or Initials. <120>.................... Title of Invention...... ............................. M. <130>.................... File Reference.......... Personal file reference...... M when filed prior to assignment of appl. number. <140>.................... Current Application Specify as: US 07/999,999 or M, if available. Number. PCT/US96/99999. <141>.................... Current Filing Date..... Specify as: yyyy-mm-dd....... M, if available. <150>.................... Prior Application Number Specify as: US 07/999,999 or M, if applicable include PCT/US96/99999. priority documents under 35 USC 119 and 120. <151>.................... Prior Application Filing Specify as: yyyy-mm-dd....... M, if applicable. Date. <160>.................... Number of SEQ ID NOs.... Count includes total number M. of SEQ ID NOs. <170>.................... Software................ Name of software used to O. create the Sequence Listing. <210>.................... SEQ ID NO:#:............ Response shall be an integer M. representing the SEQ ID NO shown. <211>.................... Length.................. Respond with an integer M. expressing the number of bases or amino acid residues. [[Page 29637]] <212>.................... Type.................... Whether presented sequence M. molecule is DNA, RNA, or PRT (protein). If a nucleotide sequence contains both DNA and RNA fragments, the type shall be ``DNA.'' In addition, the combined DNA/ RNA molecule shall be further described in the <220> to <223> feature section. <213>.................... Organism................ Scientific name, i.e. Genus/ M species, Unknown or Artificial Sequence. In addition, the ``Unknown'' or ``Artificial Sequence'' organisms shall be further described in the <220> to <223> feature section. <220>.................... Feature................. Leave blank after <220>. <221- M, under the following 223> provide for a conditions: if ``n,'' description of points of ``Xaa,'' or a modified or biological significance in unusual L-amino acid or the sequence.. modified base was used in a sequence; if ORGANISM is ``Artificial Sequence'' or ``Unknown'; if molecule is combined DNA/RNA'' <221>.................... Name/Key................ Provide appropriate M, under the following identifier for feature, conditions: if ``n,'' preferably from WIPO ``Xaa,'' or a modified or Standard ST.25 (1998), unusual L-amino acid or Appendix 2, Tables 5 and 6. modified base was used in a sequence. <222>.................... Location................ Specify location within M, under the following sequence; where appropriate conditions: if ``n,'' state number of first and ``Xaa,'' or a modified or last bases/amino acids in unusual L-amino acid or feature. modified base was used in a sequence. <223>.................... Other Information....... Other relevant information; M, under the following four lines maximum. conditions: if ``n,'' ``Xaa,'' or a modified or unusual L-amino acid or modified base was used in a sequence; if ORGANISM is ``Artificial Sequence'' or ``Unknown''; if molecule is combined DNA/RNA. <300>.................... Publication Information. Leave blank after <300>...... O. <301>.................... Authors................. Preferably max of ten named O. authors of publication; specify one name per line; preferable format: Surname, Other Names and/or Initials. <302>.................... Title................... ............................. O. <303>.................... Journal................. ............................. O. <304>.................... Volume.................. ............................. O. <305>.................... Issue................... ............................. O. <306>.................... Pages................... ............................. O. <307>.................... Date.................... Journal date on which data O. published; specify as yyyy- mm-dd, MMM-yyyy or Season- yyyy. <308>.................... Database Accession Accession number assigned by O. Number. database including database name. <309>.................... Database Entry Date..... Date of entry in database; O. specify as yyyy-mm-dd or MMM- yyyy. <310>.................... Patent Document Number.. Document number; for patent- O. type citations only. Specify as, for example, US 07/ 999,999. <311>.................... Patent Filing Date...... Document filing date, for O. patent-type citations only; specify as yyyy-mm-dd. <312>.................... Publication Date........ Document publication date, O. for patent-type citations only; specify as yyyy-mm-dd. <313>.................... Relevant Residues....... FROM (position) TO (position) O. <400>.................... Sequence................ SEQ ID NO should follow the M. numeric identifier and should appear on the line preceding the actual sequence. ---------------------------------------------------------------------------------------------------------------- 5. Section 1.824 is revised to read as follows: Sec. 1.824 Form and format for nucleotide and/or amino acid sequence submissions in computer readable form. (a) The computer readable form required by Sec. 1.821(e) shall meet the following specifications: (1) The computer readable form shall contain a single ``Sequence Listing'' as either a diskette, series of diskettes, or other permissible media outlined in paragraph (c) of this section. (2) The ``Sequence Listing'' in paragraph (a) (l) of this section shall be submitted in American Standard Code for Information Interchange (ASCII) text. No other formats shall be allowed. (3) The computer readable form may be created by any means, such as word processors, nucleotide/amino acid sequence editors or other custom computer programs; however, it shall conform to all specifications detailed in this section. (4) File compression is acceptable when using diskette media, so long as the compressed file is in a self-extracting format that will decompress on one of the systems described in paragraph (b) of this section. (5) Page numbering shall not appear within the computer readable form version of the ``Sequence Listing'' file. (6) All computer readable forms shall have a label permanently affixed thereto on which has been hand-printed or typed: the name of the applicant, the title of the invention, the date on which the data were recorded on the computer readable form, the operating system used, a reference number, and an application serial number and filing date, if known. [[Page 29638]] (b) Computer readable form submissions must meet these format requirements: (1) Computer: IBM PC/XT/AT, or compatibles, or Apple Macintosh; (2) Operating System: MS-DOS, Unix or Macintosh; (3) Line Terminator: ASCII Carriage Return plus ASCII Line Feed; (4) Pagination: Continuous file (no ``hard page break'' codes permitted); (c) Computer readable form files submitted may be in any of the following media: (1) Diskette : 3.50 inch, 1.44 Mb storage; 3.50 inch, 720 Kb storage; 5.25 inch, 1.2 Mb storage; 5.25 inch, 360 Kb storage. (2) Magnetic tape: 0.5 inch, up to 24000 feet; Density: 1600 or 6250 bits per inch, 9 track; Format: Unix tar command; specify blocking factor (not ``block size''); Line Terminator: ASCII Carriage Return plus ASCII Line Feed. (3) 8mm Data Cartridge: Format: Unix tar command; specify blocking factor (not ``block size''); Line Terminator: ASCII Carriage Return plus ASCII Line Feed. (4) CD-ROM: Format: ISO 9660 or High Sierra Format (5) Magneto Optical Disk: Size/Storage Specifications: 5.25 inch, 640 Mb. (d) Computer readable forms that are submitted to the Office will not be returned to the applicant. 6. Section 1.825 is revised to read as follows: Sec. 1.825 Amendments to or replacement of sequence listing and computer readable copy thereof. (a) Any amendment to the paper copy of the ``Sequence Listing'' (Sec. 1.821(c)) must be made by the submission of substitute sheets. Amendments must be accompanied by a statement that indicates support for the amendment in the application, as filed, and a statement that the substitute sheets include no new matter. (b) Any amendment to the paper copy of the ``Sequence Listing,'' in accordance with paragraph (a) of this section, must be accompanied by a substitute copy of the computer readable form (Sec. 1.821(e)) including all previously submitted data with the amendment incorporated therein, accompanied by a statement that the copy in computer readable form is the same as the substitute copy of the ``Sequence Listing.'' (c) Any appropriate amendments to the ``Sequence Listing'' in a patent; e.g., by reason of reissue or certificate of correction, must comply with the requirements of paragraphs (a) and (b) of this section. (d) If, upon receipt, the computer readable form is found to be damaged or unreadable, applicant must provide, within such time as set by the Commissioner, a substitute copy of the data in computer readable form accompanied by a statement that the substitute data is identical to that originally filed. 7. Appendix A To Subpart G to Part 1 is revised to read as follows: BILLING CODE 3510-16-P [[Page 29639]] Appendix A To Subpart G to Part 1--Sample Sequence Listing [GRAPHIC] [TIFF OMITTED] TR01JN98.006 [[Page 29640]] [GRAPHIC] [TIFF OMITTED] TR01JN98.007 [[Page 29641]] [GRAPHIC] [TIFF OMITTED] TR01JN98.008 [[Page 29642]] [GRAPHIC] [TIFF OMITTED] TR01JN98.009 [[Page 29643]] [GRAPHIC] [TIFF OMITTED] TR01JN98.010 Dated: May 22, 1998. Bruce A. Lehman, Assistant Secretary of Commerce and Commissioner of Patents and Trademarks. [FR Doc. 98-14194 Filed 5-29-98; 8:45 am] BILLING CODE 3510-16-C