Recording, modifying and playing audio on iOS

后端 未结 2 1595
滥情空心
滥情空心 2021-01-28 03:33

EDIT: In the end I used exactly as I explained below, AVRecorder for recording the speech and openAL for the pitch shift and playback. It worked out quite well.

相关标签:
2条回答
  • 2021-01-28 04:19

    You can either use AVRecorder or something lower like the realtime IO audio unit.

    The concept of 'volume' is pretty vague. You might want to look at the difference between calculating peak and RMS values, and understanding how to integrate an RMS value over a given time (say 300ms which is what a VU meter uses).

    Basically you sum all the squares of the values. You would take the square root and convert to dBFS with 10 * log10f(sqrt(sum/num_samples)), but you can do that without the sqrt in one step with 20 * log10f(sum/num_samples).

    You'll need to do a lot of adjusting of integration times and thresholds to get it to behave the way you want.

    For pitch shifting, I think OpenAL with do the trick, the technique behind it is called band limited interpolation - https://ccrma.stanford.edu/~jos/resample/Theory_Ideal_Bandlimited_Interpolation.html

    This example shows a rms calculation as a running average. The circular buffer maintains a history of squares, and eliminates the need to sum the squares every operation. I haven't run it so treat it as pseudo code ;)

    Example:

    class VUMeter
    {
    
    protected:
    
        // samples per second
        float _sampleRate;
    
        // the integration time in seconds (vu meter is 300ms)
        float _integrationTime;
    
        // these maintain a circular buffer which contains
        // the 'squares' of the audio samples
    
        int _integrationBufferLength;
        float *_integrationBuffer;
        float *_integrationBufferEnd;
        float *_cursor;
    
        // this is a sort of accumulator to make a running
        // average more efficient
    
        float _sum;
    
    public:
    
        VUMeter()
        : _sampleRate(48000.0f)
        , _integrationTime(0.3f)
        , _sum(0.)
        {
            // create a buffer of values to be integrated
            // e.g 300ms @ 48khz is 14400 samples
    
            _integrationBufferLength = (int) (_integrationTime * _sampleRate);
    
            _integrationBuffer = new float[_integrationBufferLength + 1];
            bzero(_integrationBuffer, _integrationBufferLength);
    
            // set the pointers for our ciruclar buffer
    
            _integrationBufferEnd = _integrationBuffer + _integrationBufferLength;
            _cursor = _integrationBuffer;
    
        }
    
        ~VUMeter()
        {
            delete _integrationBuffer;
        }
    
        float getRms(float *audio, int samples)
        {
            // process the samples
            // this part accumulates the 'squares'
    
            for (int i = 0; i < samples; ++i)
            {
                // get the input sample
    
                float s = audio[i];
    
                // remove the oldest value from the sum
    
                _sum -= *_cursor;
    
                // calculate the square and write it into the buffer
    
                double square = s * s;
                *_cursor = square;
    
                // add it to the sum
    
                _sum += square;
    
                // increment the buffer cursor and wrap
    
                ++_cursor;
    
                if (_cursor == _integrationBufferEnd)
                    _cursor = _integrationBuffer;
            }
    
            // now calculate the 'root mean' value in db
    
            return 20 * log10f(_sum / _integrationBufferLength);
        }
    };
    
    0 讨论(0)
  • 2021-01-28 04:25

    OpenAL resampling will change the pitch and the duration inversely. e.g. a sound resampled to a higher pitch will play for a shorter amount of time and thus faster.

    0 讨论(0)
提交回复
热议问题