101
0/304
Loading content...
A genomics analytics team stores raw sequence reads and needs a fast pattern-audit view for downstream quality workflows.
Table: Samples
Compute four case-sensitive diagnostic flags for every row:
Task: Return every sample row with these four flags.
Output requirements:
Supported submission environments:
Samples:
| sample_id | dna_sequence | species |
|-----------|------------------|-----------|
| 1 | ATGCCCTAA | Human |
| 2 | GGGATCCTTC | Human |
| 3 | TATATCGCATAG | Mouse |
| 4 | ATGATATGGGCTA | Zebrafish |[
{"sample_id":1,"dna_sequence":"ATGCCCTAA","species":"Human","has_start":1,"has_stop":1,"has_atat":0,"has_ggg":0},
{"sample_id":2,"dna_sequence":"GGGATCCTTC","species":"Human","has_start":0,"has_stop":0,"has_atat":0,"has_ggg":1},
{"sample_id":3,"dna_sequence":"TATATCGCATAG","species":"Mouse","has_start":0,"has_stop":1,"has_atat":1,"has_ggg":0},
{"sample_id":4,"dna_sequence":"ATGATATGGGCTA","species":"Zebrafish","has_start":1,"has_stop":0,"has_atat":1,"has_ggg":1}
]Each output flag is computed independently from the same sequence, then rows are ordered by sample_id.
Samples:
| sample_id | dna_sequence | species |
|-----------|----------------|---------|
| 10 | atgCCCTAA | Human |
| 11 | ATGCCCTaa | Mouse |
| 12 | ATATGGGTTTAG | Human |
| 13 | CCGTGCGTACCC | Rat |[
{"sample_id":10,"dna_sequence":"atgCCCTAA","species":"Human","has_start":0,"has_stop":1,"has_atat":0,"has_ggg":0},
{"sample_id":11,"dna_sequence":"ATGCCCTaa","species":"Mouse","has_start":1,"has_stop":0,"has_atat":0,"has_ggg":0},
{"sample_id":12,"dna_sequence":"ATATGGGTTTAG","species":"Human","has_start":0,"has_stop":1,"has_atat":1,"has_ggg":1},
{"sample_id":13,"dna_sequence":"CCGTGCGTACCC","species":"Rat","has_start":0,"has_stop":0,"has_atat":0,"has_ggg":0}
]Pattern checks are case-sensitive. For example, lowercase 'atg' does not satisfy the start rule.
Samples:
| sample_id | dna_sequence | species |
|-----------|--------------|-------------|
| 20 | ATG | Human |
| 21 | TAA | Human |
| 22 | ATAT | Mouse |
| 23 | GGG | Zebrafish |
| 24 | ATGGGGTGA | Arabidopsis |[
{"sample_id":20,"dna_sequence":"ATG","species":"Human","has_start":1,"has_stop":0,"has_atat":0,"has_ggg":0},
{"sample_id":21,"dna_sequence":"TAA","species":"Human","has_start":0,"has_stop":1,"has_atat":0,"has_ggg":0},
{"sample_id":22,"dna_sequence":"ATAT","species":"Mouse","has_start":0,"has_stop":0,"has_atat":1,"has_ggg":0},
{"sample_id":23,"dna_sequence":"GGG","species":"Zebrafish","has_start":0,"has_stop":0,"has_atat":0,"has_ggg":1},
{"sample_id":24,"dna_sequence":"ATGGGGTGA","species":"Arabidopsis","has_start":1,"has_stop":1,"has_atat":0,"has_ggg":1}
]Short sequences are valid inputs. Each rule applies directly without requiring joins or grouping.
Constraints