CandidateTracks
CandidateTracks(self, sound=None, samples=None, sampling_frequency=None, xmin=0.0, min_max_formant=4000, max_max_formant=7000, nstep=20, n_formants=4, window_length=0.025, time_step=0.002, pre_emphasis_from=50, smoother=Smoother(), loss_fun=Loss(), agg_fun=Agg(), heuristics=[])
A class for candidate tracks for a single formant
You can provide either
- A parselmouth Soundobject to thesoundargument
xor
- An array of audio samples to the samplesargument
- The sampling frequency to the sampling_frequencyargument
- Any optional time offset to the xminargument.
If a Sound object is passed to sound, any values passed to samples, sampling_frequency and xmin are ignored.
Parameters
| Name | Type | Description | Default | 
|---|---|---|---|
| sound | pm.Sound | A parselmouth.Soundobject. | None | 
| samples | np.ndarray | A numpy array of audio samples. | None | 
| sampling_frequency | float | The audio sampling frequency. | None | 
| xmin | float | The time offset for the audio. Defaults to 0.0. | 0.0 | 
| min_max_formant | float | The lowest max-formant value to try. Defaults to 4000. | 4000 | 
| max_max_formant | float | The highest max formant to try. Defaults to 7000. | 7000 | 
| nstep | int | The number of steps from the min to the max max formant. Defaults to 20. | 20 | 
| n_formants | int | The number of formants to track. Defaults to 4. | 4 | 
| window_length | float | Window length of the formant analysis. Defaults to 0.025. | 0.025 | 
| time_step | float | Time step of the formant analyusis window. Defaults to 0.002. | 0.002 | 
| pre_emphasis_from | float | Pre-emphasis threshold. Defaults to 50. | 50 | 
| smoother | Smoother | The smoother method to use. Defaults to Smoother(). | Smoother() | 
| loss_fun | Loss | The loss function to use. Defaults to Loss(). | Loss() | 
| agg_fun | Agg | The loss aggregation function to use. Defaults to Agg(). | Agg() | 
| heuristics | list[MinMaxHeuristic | SpacingHeuristic] | A list of formant tracking heuristics to use. | [] | 
Attributes
| Name | Type | Description | 
|---|---|---|
| candidates | list[OneTrack, …] | A list of OneTracktracks. | 
| smooth_errors | np.array | The error terms for each treack in candidates | 
| winner_idx | int | The candidate track with the smallest error term | 
| winner | OneTrack | The winning OneTracktrack. | 
| file_name | str | The filename of the audio file, if set. | 
| interval | aligned_textgrid.SequenceInterval | The textgrid interval of the sound, if set. | 
| id | str | The interval id of the sound, if set. | 
| group | str | The tier group name of the sound, if set. | 
Methods
| Name | Description | 
|---|---|
| spectrograms | This will plot a grid of the candidate formant | 
| to_df | Return a polars dataframe of the candidate tracks | 
spectrograms
CandidateTracks.spectrograms(**kwargs)
This will plot a grid of the candidate formant tracks and their spectrograms. If a file_name is provided, it will save the plot to disk.
Parameters
| Name | Type | Description | Default | 
|---|---|---|---|
| formants | int | The number of formants to plot. Defaults to 3. | required | 
| maximum_frequency | float | The frequency range the spectrogram and formants will be plotted up to. Defaults to 3500. | required | 
| dynamic_range | float | A all spectrogram values below the dynamic range. will be plotted as white. Defaults to 60. | required | 
| figsize | tuple[float, float] | Width and height of the figure in inches. Defaults to (8,5). | required | 
| file_name | Path| None | If provided, how to save the spectrogram. If not provided (None) the plot will show interactively. Defaults to None. | required | 
| dpi | float | If the plot is being saved, its image resolution in dots per inch. Defaults to 75 | required | 
to_df
CandidateTracks.to_df(which='winner', output='formants')
Return a polars dataframe of the candidate tracks
Parameters
| Name | Type | Description | Default | 
|---|---|---|---|
| which | Literal[‘winner’, ‘all’] | Return just the winner track data, or all candidates. Defaults to “winner”. | 'winner' | 
| output | Literal[‘formants’, ‘param’, ‘log_param’] | Whether to output the formants or the smoothing parameters. Defaults to “formants”. | 'formants' | 
Returns
| Type | Description | 
|---|---|
| pl.DataFrame | A polars.DataFrame | 
