File size: 2,017 Bytes
a7db57c
 
 
 
 
 
 
 
90bef95
a7db57c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42bbf6f
a7db57c
 
 
 
 
 
 
 
 
42bbf6f
 
 
a7db57c
 
42bbf6f
 
a7db57c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42bbf6f
 
 
a7db57c
cceb310
42bbf6f
 
cceb310
 
 
 
42bbf6f
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
---
library_name: sklearn
license: mit
tags:
- sklearn
- skops
- tabular-classification
model_format: pickle
model_file: model_fold_2.pkl
widget:
- structuredData:
    x0:
    - 0.0
    - 0.9278350515463918
    - 0.0
    x1:
    - 59199.0
    - 7.0
    - 744.0
    x2:
    - 59047.0
    - 0.0
    - 790.0
    x3:
    - 0.0
    - 28.0
    - 8.0
    x4:
    - 0.0
    - 1.0
    - 1.0
    x5:
    - 1.0
    - 1.0
    - 1.0
---

# Model description

<details>
<summary> Click to expand </summary>

This is a random forest classifier capable of predicting whether a given pair of transcriptscome from the same gene or not. It is trained on a dataset derived from ensembl transcripts, and is being evaluated on transcripts from the same sources, and others such as FlyBase

</details>

# How to Get Started with the Model

[More Information Needed]

# Model Card Authors

<details>
<summary> Click to expand </summary>

Andrew Green (afg1)

</details>

# Model Card Contact

You can contact the model card authors through following channels:
[More Information Needed]

# Citation

Below you can find information related to citation.

**BibTeX:**
```
[More Information Needed]
```

# Intended uses & limitations

<details>
<summary> Click to expand </summary>

This model is experimental, and is undergoing further testing.

</details>

# Five-fold cross validation

We test the model on a random subset of the transcript pairs processed from all our coordinate data. These metrics represent the performance on the binary classification task of 'do these two transcripts come from the same gene'

|   fold |   balanced_acc |       F1 |      auc |       ap |
|--------|----------------|----------|----------|----------|
|      0 |       0.970089 | 0.989711 | 0.995063 | 0.998216 |
|      1 |       0.968053 | 0.98935  | 0.994941 | 0.998181 |
|      2 |       0.970278 | 0.989625 | 0.995177 | 0.998239 |
|      3 |       0.968382 | 0.989364 | 0.994861 | 0.998184 |
|      4 |       0.968858 | 0.989405 | 0.994907 | 0.997969 |