https://github.com/arturogonzalezm/filling_data_python

Last synced: 5 months ago
JSON representation

Host: GitHub
URL: https://github.com/arturogonzalezm/filling_data_python
Owner: arturogonzalezm
License: mit
Created: 2024-04-11T07:02:50.000Z (over 1 year ago)
Default Branch: master
Last Pushed: 2024-04-11T07:21:03.000Z (over 1 year ago)
Last Synced: 2025-01-02T08:14:40.318Z (6 months ago)
Language: Python
Size: 6.84 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Mercury Level Estimation Tool

This Python script is designed to estimate missing daily mercury level readings in a river. The data consists of timestamps and mercury levels, with some levels missing and marked accordingly. The script uses linear interpolation to estimate these missing values.

## Instructions:
A time series of daily readings of mercury levels in a river is provided to you.
In each test case, the day's highest level is missing for certain days.
By analysing the data, try to identify the missing mercury levels for those days.
Each row of data contains two tab-separated values: a time-stamp and the day's highest reading.

There are exactly twenty rows marked missing in each input file.
The missing values marked as "Missing_1", "Missing_2", ....."Missing_20".
These missing records have been randomly dispersed in the rows of data.

Complete the calcMissing(readings) function in the editor below.
It should print 20 rows, one for each missing value, as floats in python.

Mercury levels are all < 400. If the missing value is not found, print "Missing".

## Functionality

The core functionality is encapsulated in the `calcMissing(readings)` function. Here's a breakdown of how it works:

### Input Parsing

- The function takes a list of `readings`, where each reading is a string containing a timestamp and a mercury level (or a placeholder for missing values) separated by a tab.
- It then parses each reading, separating the timestamp from the mercury level.
- If the mercury level is marked as missing (e.g., "Missing_1"), the index of this reading is stored for later processing, and `None` is appended to the `mercury_levels` list to represent the missing value. Otherwise, the mercury level is converted to a float and stored.

### Estimating Missing Values

- For each missing value, the script calculates an estimated value using linear interpolation. This involves:
- Finding the nearest non-missing mercury levels before and after the missing value.
- Averaging these two values to estimate the missing mercury level.
- If there's no non-missing value before or after the missing value, the script uses the available non-missing value for the estimation. (In practice, this situation should not occur if the data set starts and ends with non-missing values.)

### Output

- The estimated mercury levels for the missing entries are printed to the console, formatted as floats with two decimal places.

## Running the Script

The script is intended to be run with Python 3. Ensure you have a data file named `data/input000.txt` in the same directory as the script. This file should contain the readings, starting with the number of readings on the first line followed by each reading on its own line.

To run the script, simply execute:

```bash
python3 main.py

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/arturogonzalezm/filling_data_python

Awesome Lists containing this project

README