Join us in our mission to help users become productive and confident using Soda software.
The following outlines the workflow to contribute.
- Set up docs tooling locally on your machine and clone the GitHub repo.
- Create a new branch for your work. Include the word
docs
in the name of your branch. - Follow the style guidelines to edit existing or write new content using markdown.
- Adjust the nav order of any new files in the
docs
>nav.yml
file. - Spell check your content (select all and copy to Google Docs for a thorough check) and test all links.
- Commit your contributions, create a pull request, and request a review, if you wish.
- When all tests pass, merge your pull request.
- Celebrate your new life as a published author!
Soda uses the following tools to build and publish documentation.
- GitHub to store content
- Jekyll to build and serve content
- Just the Docs to apply a visual theme
- lunrjs to facilitate searches in docs (search bar)
To contribute to Soda documentation, set up your local system to author and preview content before committing it.
- Jekyll requires Ruby 2.4.0 or higher. If necessary, upgrade or install Ruby locally.
- Install two Ruby gems: bundler and jekyll. Refer to Jekyll documentation for details.
$ gem install --user-install bundler jekyll
- Clone the sodadata/docs repo from GitHub.
- From the command-line, navigate to the
docs
repo directory, then runbundle install
to install all the necessary gems. - Open the cloned repo locally in your favorite code editor such as Visual Code or Sublime.
- From the command-line, navigate to the
docs
repo directory, then run the following to build and serve docs locally.
$ bundle exec jekyll serve
- In a browser, navigate to http://localhost:4000/soda-sql/ to see a preview of the docs site locally. Make changes to docs files in your code editor, save the files, then refresh your browser to preview your changes.
- If you are creating a new page, consider copy+pasting the contents of the
template-new-page.md
file and pasting into your new file so that the standard header and footer info are included.
Soda uses the Splunk Style Guide for writing documentation. For any questions about style or language that are not listed below, refer to the Splunk Style Guide for guidance.
- Use American English.
- Use plain language. Do not use jargon, colloquialisms, or meme references.
- Use unbiased language. For example, instead of "whitelist" and "blacklist", use "passlist" and "denylist".
- Avoid writing non-essential content such as welcome messages or backstory.
- When referring to a collection of files, use "directory", not "folder".
- Use "you" and "your" to refer to the reader.
- Do not refer to Soda, the company, as participants in documentation: "we", "us", "let's", "our".
- Use active voice.
- Use present tense and imperative mood. See the Set up docs tooling section above for an example.
- Avoid the subjunctive mood: "should", "would", "could".
- Make the language of your lists parallel.
- Never write an FAQ page or section. FAQs are a randomly organized bucket of content that put the burden on the reader to find what they need. Instead, consciously think about when and where a user needs the information and include it there.
- Include code snippets and commands.
- Limit inclusion of screencaps. These images are hard to keep up-to-date as the product evolves.
- Include diagrams.
- Do not use "Note:" sections. Exception: to indicate incompatibility or a known issue.
- Use _includes rather than repeat or re-explain something.
- Use bold for the first time you mention a product name or feature in a document or to identify a Tip: for using a feature. See [Warehouse YAML]({% link soda-sql/warehouse.md %}) for an example. Otherwise, use it sparingly. Too much bold font renders the format meaningless.
- Use italics sparingly for emphasis, primarily on the negative. For example, "Limit the scan to only test data from today."
- Do not use underline.
- Use ALL CAPS only for severe warnings. For example, "DO NOT store sensitive information."
- Use sentence case for all titles and headings.
- Use H1 headings for the page title. Use H2 and H3 as subheadings. Use H4 headings to introduce example code snippets.
- Never stack headings with no content between them. Add content or remove a heading, ideally the latter so as to avoid adding non-essential text.
- Use bulleted lists for non-linear lists.
- Use numbered lists for procedures or ordered tasks.
- Use relative links to link to other files or sections in Soda documentation.
- Use hard-coded links to link to external sources. Ensure the external link opens a new tab in a reader's browser.
- Liberally include links to the [Glossary]({% link soda/glossary.md %}), but only link the first instance of a term on a page, not all instances.
- Categorize your new content according to the following macro groups:
- Concepts - content that explains in general, without including procedural steps. Characterized by a title that does not use present tense imperative such as, "How Soda SQL works" or "Metrics".
- Tasks - content that describes the steps a user takes to complete a task or reach a goal. Characterized by a title that is in present tense imperative such as, "Install Soda SQL" or "Apply filters".
- Reference - content that presents lists or tables of reference material such as error codes or glossary.
- Produce content that focuses on how to achieve a goal or solve a problem and, insofar as it is practical, is inclusive of all products. Avoid creating documentation that focuses on how to use a single product. For example, instead of writing two documents -- one for "Troubleshoot Soda SQL" and one for "Troubleshoot Soda Cloud" -- write one Troubleshoot document that offers guidance for both tools.
- Remember that Every Page is Page One for your reader. Most people enter the docs by clicking on the result of a Google search, so they could land anywhere and you should assume your new page is the first page that a reader lands on. Give them the context for what they are reading, lots of "escape hatches" to the glossary or pre-requisite procedures, and instructions on what to read next in a "Go further" section at the bottom of all Concept or Task pages.
- Apply a grey, 1px border around screencap images embedded in pages; diagrams and graphics do not require borders.
- Narrow and tighten any the UI before you take a screencap so as to keep the image legible when embedded to a page.
- Ensure all images is legible.
- Carefully consider whether or not to include a screencap. Is it really necessary? Does it communicate more than just your words do? Screencaps of UI require regular maintenance to keep up with changes in the UI; is the image you are adding worth the extra maintenance?
Document the details of each Soda product's release using included release note files.
- Create a new branch in this Docs repo to create your product release notes.
- To the
_release-notes
folder in this Docs repo, add a new markdown file for your new product release notes. Follow the file naming structure established by existing files. - Copy and paste the contents of the
template-soda-cloud-rn.md
file or thetemplate-soda-sql-rn.md
file into your new file. - Write your release notes using the style guide for guidance on format, language, etc.
- Spell check your content. (Copy paste to a Google Doc or your favorite editor to run a spell check.)
- Commit your changes, then create a new pull request when you are ready to publish.
- In the new pull request, request a review by @janet-can if you wish, or simply squash and merge to publish.
Kramdown is the default markdown renderer that Jekyll uses. What follows are some Kramdown-specific formatting syntax.
Insert image:
Add a png
file of your logically-named image to the docs/assets/images
directory, then add this markdown to the file in which you want the image to appear:
![scan-anatomy](/assets/images/scan-anatomy.png){:height="440px" width="440px"}
Includes:
Add a markdown file of your logically-named include content to the docs/_includes
directory, then add this markdown to the file in which you want the content to appear:
{% include run-a-scan.md %}
Relative links:
[warehouse yaml]({% link soda-sql/warehouse.md %})
[airflow_bash.py](/../examples/airflow_bash.py)
Link to an anchor on a different page:
[warehouse yaml]({% link soda-sql/warehouse.md %}#to-anchor)
Link to section on same page:
[example](#example-tests-using-a-column-metric)
External link:
<a href="https://en.wikipedia.org" target="_blank">Wikipedia</a>
Comment out:
<!-- This content does not display on the web page. -->
Show code: To keep the numbered list intact, apply endraw to the end of the line of preceding text; do not prepend with a line break.
{% raw %}
{% endraw %}
Add Last modified date:
{% last_modified_at %}
Add collapse-expand toggle:
<details>
<summary>Click to expand</summary>
Long content here
and here
</details>
Add mailto link:
<a href="mailto:support@soda.io">Soda Support</a>
The jekyll-redirect-from
plugin enables authors to redirect users if a page is moved.
See documentation: https://github.com/jekyll/jekyll-redirect-from
To apply a redirect, navigate to the redirect destination file, then add the following to the file metadata at the top:
redirect_from: /pageyouwanttoredirect/
OR
redirect_from:
- /pageyouwantoredirect/
- /anotherpagetoredirect/
docs.soda.io uses the Just The Docs for the theme which, in turn, uses lunrjs for the Search tool. Per lunrjs's documentation, “the more a search term occurs in a single document, the more that term will increase that document’s score, but the more a search term occurs in the overall collection of documents, the less that term will increase a document’s score.” There is no built-in functionality in the theme to adjust the document’s scoring relative to search terms.
Work-around: artificially add a string/search term to to a document to boost the document's search score and make it appear in the auto-complete search results.
Also, lunrjs does not take into account ocurrences of terms that appear in _includes. For the Search tool to "find" a term, the term itself must occur in the content of a served markdown file. For example, though the term regex appeared multiple times in two _includes files, the Search tool registered zero occurrences of the term in the docs. Adding the term to the content of the metrics.md and sql_metrics.md files yielded expected search results.