**Update**:

Given a pandas Series:

    s = pd.Series([1,2,3,4], index=[&#39;a&#39;, &#39;b&#39;, &#39;c&#39;, &#39;d&#39;])
    
    s
    #a    1
    #b    2
    #c    3
    #d    4
    #dtype: int64

You can directly loop through it, which yield one value from the series in each iteration:

    for i in s:
        print(i)
    1
    2
    3
    4

If you want to access the index at the same time, you can use either `items` or `iteritems` method, which produces a generator that contains both the index and value:

    for i, v in s.items():
        print(&#39;index: &#39;, i, &#39;value: &#39;, v)
    #index:  a value:  1
    #index:  b value:  2
    #index:  c value:  3
    #index:  d value:  4
    
    for i, v in s.iteritems():
        print(&#39;index: &#39;, i, &#39;value: &#39;, v)
    #index:  a value:  1
    #index:  b value:  2
    #index:  c value:  3
    #index:  d value:  4

---

**Old Answer**:

You can call `iteritems()` method on the Series:

    for i, row in df.groupby(&#39;a&#39;).size().iteritems():
        print(i, row)

    # 12 4
    # 14 2

According to doc:

&gt; Series.iteritems()
&gt;
&gt; Lazily iterate over (index, value) tuples

Note: This is not the same data as in the question, just a demo.


To expand upon the answer of Psidom, there are three useful ways to unpack data from pd.Series. Having the same Series as Psidom: 

````s = pd.Series([1,2,3,4], index=[&#39;a&#39;, &#39;b&#39;, &#39;c&#39;, &#39;d&#39;])````

 - A direct loop over ````s```` yields the ```value``` of each row.
 -  A loop over
   ````s.iteritems()```` or ````s.items()```` yields a tuple with the ```(index,value)```
   pairs of each row. 
 - Using ````enumerate()```` on ````s.iteritems()```` yields a
   nested tuple in the form of: ````(rownum,(index,value))````. 

The last way is useful in case your index contains other information than the row number itself (e.g. in a case of a timeseries where the index is time).

    s = pd.Series([1,2,3,4], index=[&#39;a&#39;, &#39;b&#39;, &#39;c&#39;, &#39;d&#39;])

    for rownum,(indx,val) in enumerate(s.iteritems()):
        print(&#39;row number: &#39;, rownum, &#39;index: &#39;, indx, &#39;value: &#39;, val)

will output: 

    row number:  0 index:  a value:  1
    row number:  1 index:  b value:  2
    row number:  2 index:  c value:  3
    row number:  3 index:  d value:  4

You can read more on unpacking nested tuples [here][1].


  [1]: http://stackoverflow.com/questions/3331643/python-unpacking-an-inner-nested-tuple-list-while-still-getting-its-index-numbe

I&#39;m new to nodejs. I’m not seeing the response in ex 1, but i see in ex 2. Why? Await works for me in other places, using babel.

Ex 1

     let res = await request(url)
     console.log(res);
     console.log(res.body);

Ex 2

    request(url, function (error, res, body) {
     if (!error &amp;&amp; response.statusCode == 200) {
     console.log(body) 
     }
    });

Await works in other places, I’m using babel and required modules for es6 and es7 features. For example, await works in squelize call, i validated. But it doesn’t work for request call. Why?

Why await is not working for node request module?

I started to program Angular 2 and I stuck with an error: 

&gt; ts1206 decorators are not valid here


    @Component({   //  ts1206 decorators are not valid here
      selector: &#39;my-app&#39;,
      moduleId: module.id,
      templateUrl: &#39;app.component.html&#39;,
      styleUrls: [&#39;app.component.css&#39;]
    })

**Update:**

My tsconfig.json:

     {
      &quot;compilerOptions&quot;: {
        &quot;target&quot;: &quot;es5&quot;,
        &quot;module&quot;: &quot;commonjs&quot;,
        &quot;moduleResolution&quot;: &quot;node&quot;,
        &quot;sourceMap&quot;: true,
        &quot;emitDecoratorMetadata&quot;: true,
        &quot;experimentalDecorators&quot;: true,
        &quot;removeComments&quot;: false,
        &quot;noImplicitAny&quot;: true,
        &quot;suppressImplicitAnyIndexErrors&quot;: true
      }
    }

what can I do with it? 

    


ts1206 decorators are not valid here, Angular 2

How do you iterate over a Pandas Series generated from a `.groupby(&#39;...&#39;).size()` command and get both the group name and count.

As an example if I have:

    foo
    -1     7
     0    85
     1    14
     2     5

how can I loop over them so that in each iteration I would have -1 &amp; 7, 0 &amp; 85, 1 &amp; 14 and 2 &amp; 5 in variables?  

I tried the enumerate option but it doesn&#39;t quite work.  Example:

    for i, row in enumerate(df.groupby([&#39;foo&#39;]).size()):
    	print(i, row)

it doesn&#39;t return -1, 0, 1, and 2 for `i` but rather 0, 1, 2, 3.

How to iterate over Pandas Series generated from groupby().size()

How do you iterate over a Pandas Series generated from a <code>.groupby('...').size()</code> command and get both the group name and count.
As an example if I have:
<pre><code class="hljs language-diff">foo
-1 7
 0 85
 1 14
 2 5
</code></pre>
how can I loop over them so that in each iteration I would have -1 &#x26; 7, 0 &#x26; 85, 1 &#x26; 14 and 2 &#x26; 5 in variables?
I tried the enumerate option but it doesn't quite work. Example:
<pre><code class="hljs language-css">for i, row in enumerate(df.groupby(['foo']).size()):
	print(i, row)
</code></pre>
it doesn't return -1, 0, 1, and 2 for <code>i</code> but rather 0, 1, 2, 3.

I&#39;m quite familiar with Django, but I recently noticed there exists an `on_delete=models.CASCADE` option with the models. I have searched for the documentation for the same, but I couldn&#39;t find anything more than:

&gt; **Changed in Django 1.9:**
&gt;
&gt; `on_delete` can now be used as the second positional argument (previously it was typically only passed as a keyword argument). It will be a required argument in Django 2.0.

[An example case of usage is][1]:

    from django.db import models

    class Car(models.Model):
        manufacturer = models.ForeignKey(
            &#39;Manufacturer&#39;,
            on_delete=models.CASCADE,
        )
        # ...

    class Manufacturer(models.Model):
        # ...
        pass

What does on_delete do? (*I guess the actions to be done if the model is deleted*.)

What does `models.CASCADE` do? (*any hints in documentation*)

What other options are available (*if my guess is correct*)?

Where does the documentation for this reside?

  [1]: https://docs.djangoproject.com/en/stable/ref/models/fields/#django.db.models.ForeignKey





What does on_delete do on Django models?

There&#39;s a DataFrame in pyspark with data as below:

    user_id object_id score
    user_1  object_1  3
    user_1  object_1  1
    user_1  object_2  2
    user_2  object_1  5
    user_2  object_2  2
    user_2  object_2  6

What I expect is returning 2 records in each group with the same user_id, which need to have the highest score. Consequently, the result should look as the following:

    user_id object_id score
    user_1  object_1  3
    user_1  object_2  2
    user_2  object_2  6
    user_2  object_1  5
    
I&#39;m really new to pyspark, could anyone give me a code snippet or portal to the related documentation of this problem? Great thanks!



Retrieve top n in each group of a DataFrame in pyspark

I want to install the &#39;rope&#39; package in my current active environment using conda. Currently, the following &#39;rope&#39; versions are available:

    (data_downloader)user@user-ThinkPad ~/code/data_downloader $ conda search rope
    Using Anaconda Cloud api site https://api.anaconda.org
    Fetching package metadata: ....
    cached-property              1.2.0                    py27_0  defaults        
                                 1.2.0                    py34_0  defaults        
                                 1.2.0                    py35_0  defaults        
                                 1.3.0                    py27_0  defaults        
                                 1.3.0                    py34_0  defaults        
                                 1.3.0                    py35_0  defaults        
    rope                         0.9.4                    py26_0  defaults        
                                 0.9.4                    py27_0  defaults        
                                 0.9.4                    py33_0  defaults        
                                 0.9.4                    py34_0  defaults        
                                 0.9.4                    py26_1  defaults        
                                 0.9.4                    py27_1  defaults        
                                 0.9.4                    py33_1  defaults        
                                 0.9.4                    py34_1  defaults        
                              .  0.9.4                    py35_1  defaults        


I would like to install the following one:

                             1.3.0                    py35_0  defaults        

I&#39;ve tried all sorts of permutations of &#39;conda install&#39; which I&#39;m not going to list here because none of them are correct.

I am also not sure what the *py35_0* is (I&#39;m assuming this is the version of the python against which the package was built?) and I also don&#39;t know what &#39;defaults&#39; means?

anaconda/conda - install a specific package version

According to the [Python 2.7.12 documentation][1]:

&gt; `!s` (apply `str()`) and `!r` (apply `repr()`) can be used to convert
&gt; the value before it is formatted.
&gt; 
&gt;     &gt;&gt;&gt; import math
&gt;     &gt;&gt;&gt; print &#39;The value of PI is approximately {}.&#39;.format(math.pi)
&gt;     The value of PI is approximately 3.14159265359.
&gt;     &gt;&gt;&gt; print &#39;The value of PI is approximately {!r}.&#39;.format(math.pi)
&gt;     The value of PI is approximately 3.141592653589793.

Interestingly, the converted value is the output of `repr()`, rather than `str()`.

    &gt;&gt;&gt; str(math.pi)
    &#39;3.14159265359&#39;
    &gt;&gt;&gt; repr(math.pi)
    &#39;3.141592653589793&#39;

So what does &quot;convert the value&quot; mean here? Making it less human-readable?


  [1]: https://docs.python.org/2/tutorial/inputoutput.html#fancier-output-formatting

What does !r do in str() and repr()?

After reading Eli Bendersky&#39;s article [on implementing state machines via Python coroutines](http://eli.thegreenplace.net/2009/08/29/co-routines-as-an-alternative-to-state-machines/) I wanted to...

- see his example run under Python3
- and also add the appropriate type annotations for the generators

I succeeded in doing the first part (*but without using `async def`s or `yield from`s, I basically just ported the code - so any improvements there are most welcome*). 

But I need some help with the type annotations of the coroutines:

    #!/usr/bin/env python3

    from typing import Callable, Generator

    def unwrap_protocol(header: int=0x61,
                        footer: int=0x62,
                        dle: int=0xAB,
                        after_dle_func: Callable[[int], int]=lambda x: x,
                        target: Generator=None) -&gt; Generator:
        &quot;&quot;&quot; Simplified protocol unwrapping co-routine.&quot;&quot;&quot;
        #
        # Outer loop looking for a frame header
        #
        while True:
            byte = (yield)
            frame = []  # type: List[int]

            if byte == header:
                #
                # Capture the full frame
                #
                while True:
                    byte = (yield)
                    if byte == footer:
                        target.send(frame)
                        break
                    elif byte == dle:
                        byte = (yield)
                        frame.append(after_dle_func(byte))
                    else:
                        frame.append(byte)


    def frame_receiver() -&gt; Generator:
        &quot;&quot;&quot; A simple co-routine &quot;sink&quot; for receiving full frames.&quot;&quot;&quot;
        while True:
            frame = (yield)
            print(&#39;Got frame:&#39;, &#39;&#39;.join(&#39;%02x&#39; % x for x in frame))

    bytestream = bytes(
        bytearray((0x70, 0x24,
                   0x61, 0x99, 0xAF, 0xD1, 0x62,
                   0x56, 0x62,
                   0x61, 0xAB, 0xAB, 0x14, 0x62,
                   0x7)))

    frame_consumer = frame_receiver()
    next(frame_consumer)  # Get to the yield

    unwrapper = unwrap_protocol(target=frame_consumer)
    next(unwrapper)  # Get to the yield

    for byte in bytestream:
        unwrapper.send(byte)

This runs properly...

    $ ./decoder.py 
    Got frame: 99afd1
    Got frame: ab14

...and also typechecks:

    $ mypy --disallow-untyped-defs decoder.py 
    $

But I am pretty sure I can do better than just use the `Generator` base class in the type specs (just as I did for the `Callable`). I know it takes 3 type parameters (`Generator[A,B,C]`), but I am not sure how exactly they&#39;d be specified here.

Any help most welcome.


Proper type annotation of Python functions with yield

I have two tables and I would like to append them so that only all the data in table A is retained and data from table B is only added if its key is unique (Key values are unique in table A and B however in some cases a Key will occur in both table A and B). 

I think the way to do this will involve some sort of filtering join (anti-join) to get values in table B that do not occur in table A then append the two tables. 

I am familiar with R and this is the code I would use to do this in R.

    library(&quot;dplyr&quot;)

    ## Filtering join to remove values already in &quot;TableA&quot; from &quot;TableB&quot;
    FilteredTableB &lt;- anti_join(TableB,TableA, by = &quot;Key&quot;)

    ## Append &quot;FilteredTableB&quot; to &quot;TableA&quot;
    CombinedTable &lt;- bind_rows(TableA,FilteredTableB)

How would I achieve this in python?









Anti-Join Pandas

I have a spreadsheet like this:

    Locality	2005	2006	2007	2008	2009
					
    ABBOTSFORD	427000	448000	602500	600000	638500
    ABERFELDIE	534000	600000	735000	710000	775000
    AIREYS INLET459000	440000	430000	517500	512500

I don&#39;t want to manually swap the column with the row. Could it be possible        to use pandas reading data to a list as this:

    data[&#39;ABBOTSFORD&#39;]=[427000,448000,602500,600000,638500]
    data[&#39;ABERFELDIE&#39;]=[534000,600000,735000,710000,775000]
    data[&#39;AIREYS INLET&#39;]=[459000,440000,430000,517500,512500]



Could pandas use column as index?

`np.where` has the semantics of a vectorized if/else (similar to Apache Spark&#39;s `when`/`otherwise` DataFrame method). I know that I can use `np.where` on `pandas.Series`, but `pandas` often defines its own API to use instead of raw `numpy` functions, which is usually more convenient with `pd.Series`/`pd.DataFrame`.

Sure enough, I found `pandas.DataFrame.where`. However, at first glance, it has completely different semantics. I could not find a way to rewrite the most basic example of `np.where` using pandas `where`:

    # df is pd.DataFrame
    # how to write this using df.where?
    df[&#39;C&#39;] = np.where((df[&#39;A&#39;]&lt;0) | (df[&#39;B&#39;]&gt;0), df[&#39;A&#39;]+df[&#39;B&#39;], df[&#39;A&#39;]/df[&#39;B&#39;])

Am I missing something obvious? Or is pandas&#39; `where` intended for a completely different use case, despite same name as `np.where`? 

pandas equivalent of np.where

I have two pandas dataframes and I would like to display them in Jupyter notebook.

Doing something like: 

    display(df1)
    display(df2)

Shows them one below another:

[![enter image description here][1]][1]

I would like to have a second dataframe on the right of the first one. There is [a similar question][2], but it looks like there a person is satisfied either with merging them in one dataframe of showing the difference between them.

This will not work for me. In my case dataframes can represent completely different (non-comparable elements) and the size of them can be different. Thus my main goal is to save space.


  [1]: http://i.stack.imgur.com/e9jdGm.png
  [2]: https://stackoverflow.com/q/35790922/1090562

Jupyter notebook display two pandas tables side by side

I have this simplified dataframe:
    
    ID   Fruit
    F1   Apple
    F2   Orange
    F3   Banana 

I want to add in the begining of the dataframe a new column `df[&#39;New_ID&#39;]`  which has the number `880` that increments by one in each row.

The output should be simply like:

    New_ID   ID   Fruit
    880      F1   Apple
    881      F2   Orange
    882      F3   Banana  

I tried the following:
   
    df[&#39;New_ID&#39;] = [&quot;880&quot;] # but I want to do this without assigning it the list of numbers literally

Any idea how to solve this?

Thanks!

Content Type	Original Author	Original Content on Stackoverflow
Question	Reily Bourne	View Question on Stackoverflow
Solution 1 - Python	Psidom	View Answer on Stackoverflow
Solution 2 - Python	dbouz	View Answer on Stackoverflow

How to iterate over Pandas Series generated from groupby().size()

Python Problem Overview

Python Solutions

Solution 1 - Python

Solution 2 - Python

ts1206 decorators are not valid here, Angular 2

Why await is not working for node request module?

Attributions