Documentation Index
Fetch the complete documentation index at: https://docs.semilattice.ai/llms.txt
Use this file to discover all available pages before exploring further.

Requirements
Populations require tabular, individual-level question-answer pair data from the group of people the population aims to model. For example: data from a survey, poll, questionnaire, or onboarding form where each row is an individual’s answers to a set of questions.Question types
The API supports three question types in seed datasets:- single-choice: individuals choose a single answer from a finite list of options.
- multiple-choice: individuals choose multiple answers from a finite list of options, sometimes with a limit on the number of options they can choose.
- open-ended: individuals answer in their own words.
Technical requirements
The API accepts.csv files which meet the following requirements:
- the first column must be titled sim_id
- the sim_id column should contain a unique number in each row
- there should be between 4 and 999 question columns
- questions must be between 3 and 999 characters long
- there should be between 1 and 99,999 respondent rows
- answer cells can be empty
- non-empty answers must be no more than 999 characters long
- answers cannot contain newline characters
- there must be at least one single-choice or multiple-choice question column in your seed data file because open-ended questions cannot be tested
- for single-choice and multiple-choice questions, there should be between 1 and 500 unique answer choices
- non-empty answer cells for single-choice and open-ended questions must be valid strings representing the respondent’s answer e.g. “Somewhat Agree” or “I usually go to the beach”
- non-empty answer cells for multiple-choice questions must be valid lists of quoted strings representing the respondent’s set of answer choices e.g. ”[‘Chocolate’, ‘Vanilla’, ‘Lemon sorbet’]”
Example
This very small CSV is valid, containing answers to 4 questions from 10 respondents. The first two question columns are single-choice questions, the third is multiple-choice, and the fourth is open-ended.Qualitative requirements & guidelines
The API doesn’t check if your seed data follows these requirements and guidelines, but they are essential to creating effective populations.Column headers should be questions or propositions
Column headers should be questions or propositions
Questions cannot assume knowledge of other questions
Questions cannot assume knowledge of other questions
Answer options cannot refer to other answer options
Answer options cannot refer to other answer options
Questions and answer options must be human-readable
Questions and answer options must be human-readable
.csv, so questions and answers have to be decoded into human-readable values.All questions and answers should come from the same source such as a single survey, questionnaire, or onboarding form
All questions and answers should come from the same source such as a single survey, questionnaire, or onboarding form
Answers in a row should all come from the same person
Answers in a row should all come from the same person
Good to know
These considerations are useful to bear in mind.Other types of questions can be reformatted to be compatible, with caution
Other types of questions can be reformatted to be compatible, with caution
Question order doesn't matter
Question order doesn't matter
Answer option order is always random
Answer option order is always random
Bigger datasets can be filtered, with caution
Bigger datasets can be filtered, with caution