Data Structure 1 - array

Background

Definition

An array data structure, is a data structure consisting of a collection of elements (values or variables), each identified by at least one array index or key.
An array is stored such that the position of each element can be computed from its index tuple by a mathematical formula.

For example, an array of 10 32-bit (4 bytes) integer variables, with indices 0 through 9, may be stored as 10 words at memory addresses 2000, 2004, 2008, … 2036, so that the element with index i has the address 2000 + (i × 4).

Array can be referred and implemented to:
a linear array, 1d array, vector, tuples, tables, lookup tables, …

Arrays are also used to implement many other data structures, such as:
lists, strings,

They effectively exploit the addressing logic of computers.

In Python, Array = list - resizable (append and delete)
In Java, Array - fixed length, ArrayList - dynamic resizing, TO(1) access (amortized)

subarray -
subsequence -
substring -

Basic Operations

find() - TO(n) - find an element, i.e. [arr[i] for i in range(arr) if i…]
find_first() - TO(1) - access first element, i.e. arr[0]
find_last() - TO(1) - access last element, i.e. arr[-1]
find_arbitrary() - TO(1) - access an element at an arbitrary index, i.e. arr[i]
insert_front() - TO(n) - insert to front
insert_back() - TO(logn)?? - insert to back =? append()
insert_arbitrary() - TO(n) - insert at an arbitrary index
delete_front() - TO(n)
delete_back() - TO(logn)
delete_arbitrary() - TO(n)
resize()??? - O(n) - resizing backing array???
len() - TO(logn)
isEmpty() - TO(1)
min()/max() - TO(n)??
kthLargest()/kthSmallest() - TO(n)/TO(n^2) (random pivot selection method), TO(n) (median of medians pivot selection method)
find_median() - TO(n) (kth selection)

Python List - built-in see python built-in #list

l.append(ele) - addition elements at the end of the list
l.insert(pos, ele) - addition elements at the desired position
l.extend(l2) - add list 2 to the end of list 1
sum(l) - only for numertic values
len(l)
l.count(ele) - calculates total occurrence of given element
l.index(ele, start, end) - return the index of the first occurrence
min(l) / max(l)
l.sort() / sorted(l) - sort in ascending order, descending order if reverse=True
l.reverse()
l.pop(idx) - take index, default: show and delete the last element
l.remove(ele) - delete the first occurrence
del - del l[idx], i.e. del l[3:5]
l.clear() - erase all
l.copy()

Implementation

Dynamic Array in Python see dynamic array in python

A dynamic array is similar to an array, but with the difference that its size can be dynamically modified at runtime.
A dynamic array can, once the array is filled, allocate a bigger chunk of memory, copy the contents from the original array to this new space, and continue to fill the available slots.

Steps:

create array with capacity=1, len start from 0

when appending new element, check if len==capacity
if len!=cap, make a new array with cap=2*cap, copy old arr eles to new
else arr[len]=new ele, increase len by 1

   old arr: 1 2 3 4            old arr with refs
            | | | |    
            r r r r

   new arr: _ _ _ _ _ _ _ _    creating new arr

   new arr: 1 2 3 4 _ _ _ _    store elements from old arr to new arr

   new arr: 1 2 3 4 _ _ _ _    reassign ref from old to new
            | | | |
            r r r r

import ctypes #c language types
class DynamicArray():

    def __init__(self):
        self.n=0
        self.capacity=1
        self.arr=self.make_array(self.capacity)

    def __len__(self):  #later check
        return self.n

    def __getitem__(self,i):
        if not 0<=i<self.n:
            return IndexError('i is out of bounds!')

        return self.arr[i]

    def append(self,ele):
        if self.n==self.capacity:
            self._resize(2*self.capacity)

        self.arr[self.n]=ele
        self.n+=1

    def _resize(self,new_size):
        new_arr=self.make_array(new_size)
        for i in range(self.n):
            new_arr[i]=self.arr[i]

        self.arr=new_arr
        self.capacity=new_size

    def make_array(self,new_size):
        return (new_size*ctypes.py_object)()

resizing factor??

Problems

Find Element

leetcode 287 - Find the Duplicate Number [M] - two pointers
Given an array nums containing n + 1 integers where each integer is between 1 and n (inclusive), prove that at least one duplicate number must exist. Assume that there is only one duplicate number, find the duplicate one.

Input: [1,3,4,2,2]
Output: 2
Input: [3,1,3,4,2]
Output: 3

Solution: TO(n)
Suppose no dups array [1,2,3]

0 1 2
1,2,3

idx       
0,arr[0]->1,arr[1]->2,arr[2]->3

idx ele idx ele idx ele
0 -> 1, 1 -> 2, 2 -> 3

Suppose dups array [1,3,3,2]

0 1 2 3
1,3,3,2

i    e i    e i    e i
0 -> 1,3 -> 2,3 -> 2,3
            |--------| circle

Steps:

move slow pointer 1 step and fast pointer 2 steps until collide
restart the fast from index 0 and move both pointers 1 step at a time

Details:

0 1 2 3 4
1,3,4,2,2
s f
  s f
    f s
    s/f

f       s
  f s
      f s  return 2

class Solution():
    def findDuplicate(self,nums):
        slow=nums[0]
        fast=nums[slow]

        while fast!=slow:
            slow=nums[slow]
            fast=nums[nums[fast]]

        fast=0
        while fast!=slow:
            slow=nums[slow]
            fast=nums[fast]

        return fast

leetcode 645 - Set Mismatch [E]
The set S originally contains numbers from 1 to n. But unfortunately, due to the data error, one of the numbers in the set got duplicated to another number in the set, which results in repetition of one number and loss of another number.
Given an array nums representing the data status of this set after the error. Your task is to firstly find the number occurs twice and then find the number that is missing. Return them in the form of an array.

Input: nums = [1,2,2,4]
Output: [2,3]

In-place Move

leetcode 26 - Remove Duplicates from Sorted Array [E]
Given a sorted array nums, remove the duplicates in-place such that each element appear only once and return the new length.
Do not allocate extra space for another array, you must do this by modifying the input array in-place with O(1) extra memory.

Given nums = [1,1,2],
Your function should return length = 2, with the first two elements of nums being 1 and 2 respectively.
Given nums = [0,0,1,1,1,2,2,3,3,4],
Your function should return length = 5, with the first five elements of nums being modified to 0, 1, 2, 3, and 4 respectively.

class Solution():
    def removeDuplicates(self,nums):
        i=0
        for j in range(len(nums)):
            if j==0 or nums[j]!=nums[j-1]:  #insert nums[0] in the beginning as always
                nums[i]=nums[j]
                i+=1

        return i

leetcode 27 - Remove Element [E]
Given an array nums and a value val, remove all instances of that value in-place and return the new length.
Do not allocate extra space for another array, you must do this by modifying the input array in-place with O(1) extra memory.
The order of elements can be changed. It doesn’t matter what you leave beyond the new length.

Given nums = [3,2,2,3], val = 3,
Your function should return length = 2, with the first two elements of nums being 2.
Given nums = [0,1,2,2,3,0,4,2], val = 2,
Your function should return length = 5, with the first five elements of nums containing 0, 1, 3, 0, and 4.

class Solution():
    def removeElement(self,nums,val):
        i=0
        for n in nums:
            if n!=val:
                nums[i]=n
                i+=1

        return i

leetcode 283 - Move Zeroes [E] - two pointers
Given an array nums, write a function to move all 0’s to the end of it while maintaining the relative order of the non-zero elements.

Input: [0,1,0,3,12]
Output: [1,3,12,0,0]

class Solution():
    def moveZeroes(self,nums):
        i=0
        for n in nums:
            if n!=0:
                nums[i]=n
                i+=1

        nums[i:]=[0]*(len(nums)-i)

leetcode 189 - Rotate Array [E]
Given an array, rotate the array to the right by k steps, where k is non-negative.
Example:
Input: [1,2,3,4,5,6,7] and k = 3
Output: [5,6,7,1,2,3,4]
Explanation:
rotate 1 steps to the right: [7,1,2,3,4,5,6]
rotate 2 steps to the right: [6,7,1,2,3,4,5]
rotate 3 steps to the right: [5,6,7,1,2,3,4]

class Solution():
    def rotate(self,k,nums):
        #in-place
        n=len(nums)
        nums[:]=nums[-k%n:]+nums[:n-k%n]

Subsequence/Subarray

leetcode 128 - Longest Consecutive Sequence [H]
Input: [100, 4, 200, 1, 3, 2]
Output: 4
Explanation: The longest consecutive elements sequence is [1, 2, 3, 4]. Therefore its length is 4.

Solution:

make a set for removing the repeated
filter out non-head numbers using if-continue
find the head number (which is 1 from the example) and collect the consecutive

class Solution():
    def longestConsecutive(self, nums):
        nset=set(nums)
        longest=0
        for n in nset:
            if n-1 in nset:
                continue      #filter out non-head numbers
            seq=0
            while n in nset:  #find the head number and collect the consecutive
                seq+=1
                n+=1
            longest=max(longest,seq)

        return longest               

leetcode 152 - Maximum Product Subarray [M] - minmax
Example 1:
Input: [2,3,-2,4]
Output: 6
Explanation: [2,3] has the largest product 6

Example 2:
Input: [-2,0,-1]
Output: 0
Explanation: The result cannot be 2, cuz [-2,-1] is not a subarray

Solution:
Calculate the most positive and most negative subarrary products ending at each element
Either the element alone or multiplied by previous most positive and most negative

[4,-2,2,3] output:

     n,   most_pos*n,        most_neg*n  
     4,     1*4=4,              1*4=4
    -2,    4*-2=-8,            4*-2=-8
     2,    -2*2=-4,            -8*2=-16   
          [-2=max(-2,-8,-8)]  [-8=min(-2,-8,-8)]
     3,     2*3=6,            -16*3=-48
           [2=max(2,-4,-16)] [-16=min(2,-4,-16)]

class Solution():
    def maxProduct(self,nums):
        largest_product=float('-inf')
        most_p,most_n=1,1 #most positive and most negative
        for n in nums:
            most_p,most_n=max(n,most_p*n,most_n*n),min(n,most_p*n,most_n*n)
            largest_product=max(largest_product,most_p,most_n)

        return largest_product

leetcode 209 - Minimum Size Subarray Sum [M] - two pointers / binary search
Given an array of n positive integers and a positive integer s, find the minimal length of a contiguous subarray of which the sum ≥ s. If there isn’t one, return 0 instead.

Example:
Input: s = 7, nums = [2,3,1,2,4,3]
Output: 2
Explanation: the subarray [4,3] has the minimal length under the problem constraint.
Follow up:
O(n) - two pointers
O(n log n) - binary search

Solution 1: two pointers

set left,right pointers at the beginning which is 0 index
right++ and sum up all before right
when sum>=target, len=min(len,right-left+1), len_ini=float(‘inf’)
start moving left, left++ and sum-=nums[left]

class Solution():
    def maxSubArrayLen(self,s,nums):
        left,right=0,0
        sum=0
        res=float('inf')
        while right<len(nums):
            sum+=nums[right]

            while sum>=s:
                res=min(res,right-left+1)
                sum-=nums[left]
                left+=1

            right+=1

        return res if res!=float('inf') else 0

Solution 2: binary search

make a sum list from 0 len=len(nums)+1
l from index 1 of suml, search the suml element > s, return len=ele_index-l+1
l from index 2 of suml, search the suml element-suml[l] > s, return l

…

 origin 2, 3, 1, 2, 4, 3         s=7
 index  0  1  2  3  4  5  6   
 suml   2  5  6  8  12 15   find target=suml[0]-nums[0]+s=2-2+7=7
        l        ^          8>7
           l        ^       find target=suml[1]-nums[1]+s=5-3+7=9
              l     ^       find target=suml[2]-nums[2]+s=6-1+7=12
                 l     ^    find target=suml[3]-nums[3]+s=8-2+7=13
                    l  ^    find target=suml[4]-nums[4]+s=12-4+7=15

class Solution():
    def minSubArrayLen(self,s,nums):
        suml=[n for n in nums]
        for i in range(1,len(nums)):
            suml[i]=suml[i-1]+nums[i]

        res=float('inf')
        for i in range(len(suml)):
            #the position of suml > target
            pos=self.binarysearch(i,len(suml),suml,suml[i]-nums[i]+s)
            if pos<len(suml):
                res=min(res,pos-i+1)

        return res if res!=float('inf') else 0

    def binarysearch(self,left,right,suml,target):

        while left<right:
            mid=(left+right)//2
            if suml[mid]<target:
                left=mid+1
            else:
                right=mid

        return left      

leetcode 259 - 3Sum Smaller [M] - two pointers
Given an array of n integers nums and a target,
Find the number of index triplets i,j,k with 0<=i<j<k<n that satisfy the condition
nums[i]+nums[j]+nums[k]<target

Input: nums=[-2,0,1,3], target=2
Output: 2
Explanation: there are two triplets sums less than 2, [-2,0,1] and [-2,0,3]

Solution of [-2,0,1,3,4,-1,2],target=3:
most minimum two from start + max < target means indices between max all valid

[-2,0,1,3,4,-1,2]
sorted
  0  1 2 3 4 5 6
[-2,-1,0,1,2,3,4]
  i  l         r

sum([-2,-1,4])<3
means [-2,-1,0][-2,-1,1][-2,-1,2][-2,-1,3][-2,-1,4] all count

class Solution():
    def threeSumSmaller(self,nums,target):
        cnt=0
        nums.sort()

        for i in range(len(nums)-2):
            l,r=i+1,len(nums)-1
            while l<r:
                if nums[i]+nums[l]+nums[r]<target:
                    cnt+=r-l
                    l+=1
                else:
                    r-=1

        return cnt

leetcode 53 - Maximum Subarray
leetcode - Maximum Size Subarray Sum Equals k
leetcode 560 - Subarray Sum Equals K
leetcode 718 - Maximum Length of Repeated Subarray [M]
leetcode 713 - Subarray Product Less Than K []

Range

leetcode 228 - Summary Ranges [M]
Given a sorted integer array without duplicates, return the summary of its ranges.

Example 1:
Input: [0,1,2,4,5,7]
Output: [“0->2”,”4->5”,”7”]
Explanation: 0,1,2 form a continuous range; 4,5 form a continuous range.

Example 2:
Input: [0,2,3,4,6,8,9]
Output: [“0”,”2->4”,”6”,”8->9”]
Explanation: 2,3,4 form a continuous range; 8,9 form a continuous range.

Solution 1:
brute force scanning, but be careful of the final edge case

class Solution():
    def summaryRanges(self,nums):
        res=[]
        start,l=0,0
        while l<len(nums):
            if l!=len(s)-1 and nums[l+1]-nums[l]==1:
                l+=1
            elif l==len(nums)-1: #last element or continuous elements
                if start==l:
                    res.append(str(nums[start]))
                else:
                    res.append(str(nums[start])+'->'+str(nums[l]))
                break
            else:
                if start==l:
                    res.append(str(nums[start]))
                else:
                    res.append(str(nums[start])+'->'+str(nums[l]))

                start=l+1
                l+=1

        return res

Solution 2:
record [start,end] pairs
if emtpy or not continue: append [n,n]
if continue: change the last bit to [n,n+1]

[0,1,2,4,5,7]

if emtpy or not continue:  if continue:
    [0,0]                     [0,1]
                              [0,2]
    [4,4]                     [4,5]
    [7,7]                     [7,7]

class Solution():
    def summaryRanges(self,nums):
        summary=[]
        for n in nums:
            if not summary or n>summary[-1][1]+1:
                summary.append([n,n])
            else:
                summary[-1][1]=n

        return [str(i) if i==j else str(i)+'->'+str(j) for i,j in summary]

Dislocation

leetcode 238 - Product of Array Except Self [M]
Given an array nums of n integers where n>1, return an array output such that output[i] is equal to the product of all the elements of this array except nums[i]

Example:
Input: [1,2,3,4]
Output: [24,12,8,6]

Do it without division and in TO(n)

Solution:
cross productiong from both left and right side

left side product
[1]
  \
[1,1,2,6]

[1x2]
    \
[1,1,2,6]

[1x2x3]
      \
[1,1,2,6]

right side product

      [4]
      /
[1,1,2,6]
[1,1,8,6]

    [3x4]
    /
[1,12,8,6]

  [2x3x4]
  /
[24,12,8,6]

class Solution():
    def productExceptSelf(self,nums):
        prod=[1]
        for i in range(1,len(nums)):
            prod[i]=nums[i]*prod[i-1]

        right=1
        for i in range(len(nums)-1,-1,-1):
            prod[i]*=right
            right*=nums[i]

        return prod

Flatten

flatten a 2D list

use list comprehension

l=[[1, 2, 3], [3, 6, 7], [7, 5, 4]]
f=[i for sub in l for i in sub]

use chain.iterable()

from itertools import chain
l=[[1, 2, 3], [3, 6, 7], [7, 5, 4]]
f=list(chain.from_iterable(l))
>>f
[1, 2, 3, 3, 6, 7, 7, 5, 4]

use functools.reduce (check later!!)

from functools import reduce
l=[[1, 2, 3], [3, 6, 7], [7, 5, 4]]
f=reduce(lambda z,y:z+y,l)

use numpy

a=np.array([[1, 2, 3], [3, 6, 7], [7, 5, 4]])
a.flatten()
>>array([1, 2, 3, 3, 6, 7, 7, 5, 4])

flatten a nested list - recursion
Input:
l = [1, 2, [3, 4, [5, 6]], 7, 8, [9, [10]]]
Output:
l = [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

l=[1,2,[3,4,[5,6]],7,8,[9,[10]]]  
res=[]
def flatten(l):
    for i in l:
        if type(i)==list:
            flatten(i)
        else:
            res.append(i)

>>res
[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]

leetcode 251 - Flatten 2D Vector [M]
Implement an iterator to flatten a 2d vector.

Input: [[1,2],[3],[4,5,6]]
Output: [1,2,3,4,5,6]

Explanation:
By calling next repeatedly until hasNext returns False
The order of elements returned by next should be:
[1,2,3,4,5,6]

Solution:
use two pointers l for each sublist, i for each element in each sublist
l,i=0,0, increase i and l in every step

class Vector2D():
    def __init__(self,vec):
        self.vec=vec
        self.l,self.i=0,0

        #if empty sublist
        while self.l<len(self.vec) and len(self.vec[self.l])==0:
            self.l+=1

    def next(self):
        res=self.vec[self.l][self.i]
        if self.i<len(self.vec[self.l])-1:  #still in current sub list
            self.i+=1
        else:
            self.i=0
            self.l+=1
            #if empty sublist
            while self.l<len(self.vec) and len(self.vec[self.l])==0:
                self.l+=1

        return res

    def hasNext(self):
        return self.l<len(self.vec)

v=Vector2D([[1,2],[3],[4,5,6]])
print(v.hasNext())
print(v.next())
print(v.next())
print(v.next())
print(v.next())
print(v.next())
>>
True
1
2
3
4
5

leetcode 341 - Flatten Nested List Iterator [M]
Given a nested list of integers, implement an iterator to flatten it.
Each element is either an integer, or a list – whose elements may also be integers or other lists.

Example 1:
Input: [[1,1],2,[1,1]]
Output: [1,1,2,1,1]
Explanation: By calling next repeatedly until hasNext returns false, the order of elements returned by next should be: [1,1,2,1,1].

Example 2:
Input: [1,[4,[6]]]
Output: [1,4,6]
Explanation: By calling next repeatedly until hasNext returns false, the order of elements returned by next should be: [1,4,6].

Solution:
first transfer to list and then using pop

class NestedIterator():
    def __init__(self,nestedList):
        self.res=[]
        self.flatten(nestedList)
        self.res=self.res[::-1]

    def flatten(self,l):
        for i in l:
            if type(i)==list:
                self.flatten(i)
            else:
                self.res.append(i)

    #should use built-in functions like following
    #isInteger, getList
    def flatten(self,l):
        for i in l:
            if i.isInteger():
                self.res.append(i)
            else:
                self.flatten(i.getList())

    def next(self):

        return self.res.pop()

    def hasNext(self):

        return len(self.res)!=0

n=NestedIterator([[1,1],2,[1,1]])
print(n.next())
print(n.next())
print(n.next())
print(n.hasNext())
print(n.next())
print(n.next())
print(n.hasNext())

Permutation

leetcode 31 - Next Permutation [E]
Implement next permutation, which rearranges numbers into the lexicographically next greater permutation of numbers.
If such arrangement is not possible, it must rearrange it as the lowest possible order (ie, sorted in ascending order).

The replacement must be in-place and use only constant extra memory.

1,2,3 → 1,3,2  
3,2,1 → 1,2,3  
1,1,5 → 1,5,1  

Solution of 1,2,7,4,3,1:

find the top descending idx r from right most
partially reverse the nums[r:]
find the first element in nums[r:] > r-1
swap nums[r-1] with the found nums