| HTML Guide: A Complete MarkUp Set | cern archive | HTTP only |
| home archive home about | |||||||||||||
A Complete Set of ConstructsThe recommended usage is incomplete; it only includes those constructs that are easy to implement and explain. This section discusses a few more constructs that allow you to do anything that can legally be done. There are constructs beyond these, but they can all be reduced to constructs shown here.
Document StructureAn HTML document is a header part followed by a BODY element.The header part consists of the TITLE, ISINDEX, and NEXTID elements which each appear zero or one time in any order. (see ISINDEX test, no title test) The BODY start and end tags may be omitted. They will be inferred by SGML parsers. "Recommended Usage" is an example of this. This entity is an example of explicitly including the BODY tags. The PLAINTEXT tag signals the end of the HTML text entity, and the beginning of a non-SGML data entity. (The format of the data is governed by the MIME text/plain content type.) See Also:
Header ElementsTITLEThe title can have an '<' character, as long as it's not followed by a '/' and a letter. See the section on SGML delimiters in CDATA.Body ElementsThe normal text content of body elements may include several kinds of markup.A comment that you shouldn't see: For copyrights, RCS keywords, etc. processing instruction: If you've _got_ to stick TeX macros or something in there, use this. The sample implementation won't even tell you it's there, though.
Entity ReferencesEntity references are recognized in normal body elements (anyplace #PCDATA appears in the DTD) and attribute value literals. See the Entities section of "Text and Markup" for more details. The HTML DTD defines the following entities for characters that might otherwise be parsed as markup:
HTML Entities
ISO Latin-1 CharactersThe HTML DTD references the public text "ISO 8879:1986//ENTITIES Added Latin 1//EN" to define entities for latin-1 characters, for example Gödel was a famous mathemetician.AnchorsOrder and Apperance of Attributesname implied
Quotes In Attribute ValuesIn order to include quotes in the value of the content-type attribute, use """ and "'" entity references: link to SGMLS software distribution with fancy content-type attributeNote: Interpretation of LiteralsSection 7.9.3 of the SGML standard states
Since to date there are no HTML attributes containing newlines or spaces, that is not much of an issue. @@But replacement of literals is. For one thing, this creates an interaction between the syntax of URLs and SGML syntax. We could resolve this issue by removing '&' from the URL syntax .
HeadingsSix levels of headings are defined:
Level four headingAnother level four heading. It's long. It's only conventional and suggested that lines be less than 72 characters long. It's certainly not specified, defined, or required.Level five headingLevel six headingParagraphsNormal paragraphs consist of text consisting of words, sentences, and other stuff. Line breaks are not significant. This is still the first paragraph of this section.Here's the second paragraph. It's long. It's only conventional and suggested that lines be less than 72 characters long. It's certainly not specified, defined, or required. A P tag isn't needed between a paragraph and some other element, like a heading. Ordered listsThese are for things like lists of steps, where the order is significant.
Case of names is not significant: different casesCase of names is not significant: both lower caseTYPEWRITER
Literal Text ElementsTabs in XMP content: |
|
||||||||||||