Skip to main content

Table 1 Sampling fraction and sub-network size. In the present context, the true network has been taken to be the available PIN dataset (which contains itself interactions among 4773 out of an estimated 6000 S. cerevisiae proteins). The relationship between sampling fraction p and number of edges in the subnet is quadratic M S = p 2 M N MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBamrtHrhAL1wy0L2yHvtyaeHbnfgDOvwBHrxAJfwnaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaWaaeGaeaaakeaacqWGnbqtdaWgaaWcbaacdaGae8NeXpfabeaakiabg2da9iabdchaWnaaCaaaleqabaGaeGOmaidaaOGaemyta00aaSbaaSqaaiab=1q8obqabaaaaa@4056@ . The last line shows the extrapolation from the present network to the true network size assuming random sampling.

From: The effects of incomplete protein interaction data on structural and evolutionary inferences

Sampling fraction

Number of proteins

Mean number of interactions

0.2

955

602

0.4

1907

2423

0.6

2864

5465

0.8

3819

9716

1.0

4773

15181

Full network

≈6000

≈23700